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Abstract of the Dissertation 



Strongly Interacting Higgs Sector Without Technicolor 

by 

Chuan Liu 
Doctor of Philosophy in Physics 
University of California, San Diego, 1994 
Professor Julius Kuti, Chairman 

The theoretical framework for the higher derivative O(N) scalar field theory 
is established and the theory is shown to be finite and unitary with the indefinite 
metric quantization. It has been shown that if the ghost states are represented by 
a complex conjugate pair, the theory is free of any logical inconsistencies and the 
ghost pair can easily evade the experimental tests. 

With an underlying hypercubic lattice structure, the higher derivative 0(4) 
model is studied nonperturbatively in computer simulations. The Higgs mass bound 
problem is also studied within the framework of higher derivative theory. A much 
higher Higgs mass value in the TeV range is found with the ghost pair well-hidden in 
the multi-TeV range. Therefore, the higher derivative 0(4) model can incorporate a 
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strongly interacting Higgs sector without introducing more complicated structures, 
like technicolor, which was impossible for the conventional lattice scalar model. This 
means that, although the added higher derivative term is a higher dimensional op- 
erator, it changes the fundamental features (metric, energy spectrum, strength of 
interaction, etc.) of the theory so much that we can no longer view it as an irrel- 
evant operator in the Lagrangian. Moreover, due to the strong interaction of the 
theory, it would be impossible to meaningfully define the scaling violation in the 
higher derivative 0(4) model. This implies that we will not be able to set up the 
Higgs mass bound in this theory unless a new nonperturbative interpretation of the 
Higgs mass bound is developed. 



xiii 



Chapter 1 
Introduction 



1.1 The Higgs Sector of the Minimal Standard 



The Standard Model was first introduced in the late 1960's to unify the 
electromagnetic and weak interactions [1, 2, 3]. The symmetry group of the Standard 
Model is SU(2) L x U(l)y- The Minimal Standard Model corresponds to taking only 
one Higgs doublet in the basic representation of SU(2). The action of the Standard 
Model consists of several sectors which are coupled together. The action of the Higgs 
sector for the theory can be written as: 



Model 




(i.i) 



where is a SU (2)-doublet Higgs field, 
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In the limit of small gauge coupling and Yukawa coupling, the Higgs sector decouples 
from the rest and becomes a 4 type scalar field theory with a global symmetry 0(4). 
This limit is also referred to as the 0(4) limit of the Minimal Standard Model. In the 
Standard Model, the symmetry SU(2) L x U(l)y is spontaneously broken to U(l) em , 
which, in the 0(4) limit, corresponds to the symmetry breaking 0(4) — > 0(3). In 
this limit, the Higgs mass and the vacuum expectation value v are related by 

m H = V8\v, (1.4) 

where A is the renormalized coupling constant. The experimental value for v is fixed 
to be v ps 250 GeV. Therefore, the ratio m H /v also characterizes the strength of the 
quartic self-interaction. 

The 0(4) limit of the Standard Model is a very interesting limit to study 
for two reasons. First of all, the 577(2) gauge coupling is found to be very small, 
g 2 ps 0.4. Therefore, the effects of the gauge fields on the Higgs sector is perturbative. 
Although the mass of the top quark remains to be determined, it is unlikely that the 
top mass will be much higher than 200 GeV. All the quark masses are rather light 
compared with the weak scale, therefore the effects of the fermion sector can also 
be evaluated within perturbation theory. In other words, the symmetry breaking 
mechanism is almost completely determined by the Higgs sector alone, plus some 
perturbative corrections. Secondly, on the phenomenology side, people have shown 
a so-called Equivalence Theorem [4] which says: when the center of mass energy y/s 
is much higher than the W boson mass, the scattering amplitude of the W bosons 
in the full Standard Model is equal to the scattering amplitude of the corresponding 
channel in the 0(4) model with 0(m w / ^/s) corrections. One of the methods in 
the Higgs search experiment is utilizing the WW or ZZ boson scattering channel. 
Therefore, if we consider the energy range for the Higgs search, assuming that the 
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Higgs mass is above the vev scale or even in the TeV range, the 0(4) limit would 
be a good approximation for the WW scattering in the Standard Model. Thus, we 
conclude that the 0(4) limit of the Standard Model would be a very good laboratory 
for the investigation of the symmetry breaking mechanism and the mass of the Higgs 
particle. 

1.2 The Triviality Higgs Mass Bound 

Despite the successes of the Standard Model, two types of particle that are 
important in this model remain unconvinced by the experiments, namely, the top 
quark and the Higgs particle. The missing of the Higgs particle is very problem- 
atic because the Higgs plays such an important role in the spontaneous symmetry 
breaking which gives rise to all the masses of gauge bosons and fermions. In the past 
decade, there have been many efforts to put an upper bound on the Higgs mass. The 
early works utilized the tree level unitarity and the unitarity bound was found to be 
around 1 TeV [4]. Later, it was then realized that the 0(N) scalar field theory is a 
trivial field theory and this implies an upper bound on the Higgs mass [21, 22]. 

The triviality picture of field theory was first encountered by Landau et. al. 
long ago when studying the renormalization properties of Quantum Electrodynamics 
(QED) [20]. They discovered that, if the cutoff was brought to infinity in QED, 
the renormalized coupling constant of QED (the electric charge) was driven to zero 
logarithmically. Therefore, in order to have an interacting theory, a large but finite 
cutoff had to remain in the theory. Thus, QED has a built-in cutoff parameter. 
This implies that every quantity calculated in QED depends on this arbitrary cutoff 
parameter. As the energy scale gets closer and closer to the cutoff, there is more 
dependence on this arbitrary cutoff parameter. It seems then that we will lose the 
predictability of the theory. In fact, this is not a problem at all for QED. The 
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built-in cutoff scale, also know as the Landau ghost scale, is enormous (typically 
A ~ 10 137 MeV) for QED and therefore the dependence of the physical quantities 
on this cutoff scale is negligible. Furthermore, before this energy scale is reached, 
new physics (weak interactions, strong interactions) will set in and QED must be 
modified. However, one thing becomes clear from the above discussion, namely, we 
cannot calculate to arbitrary accuracy in a trivial field theory due to the existence 
of the arbitrary cutoff parameter. 

The triviality scenario of the Higgs sector is quite similar to that of QED, 
except that in the Higgs sector we do not know the mass of the Higgs and the coupling 
constant. Therefore, the built-in cutoff for the Higgs sector could be as low as a few 
TeV, or as high as the Planck scale, depending on the value of the Higgs mass. Also, 
we do not know the nature of the new physics that lies between the built-in cutoff 
and the weak scale, if there is any. 

The triviality of the Higgs sector can be easily seen in either perturbation 
theory or in the 1/N expansion [31] of the model. Extensive nonperturbative studies 
have also been performed on this model with a lattice regulator [25, 26, 27]. All 
nonperturbative simulation results suggest that the triviality scenario found in per- 
turbation theory is a feature of the full theory. In these studies, the upper bound of 
the Higgs particle was found to be about 640 GeV under some well defined conditions 
which we now come to. 

With the lattice regulator, the theory is made finite and the momentum cutoff 
is given by A = ir/a. The continuum limit is achieved by taking A — > oo, or equiva- 
lently, taking the lattice correlation length £ — > oo. For very large A/m#, triviality 
of the theory forces the renormalized coupling constant \r to go to zero logarith- 
mically. Since the vacuum expectation value v is fixed in physical units, this would 
mean the Higgs mass is also going to zero in this limit like rnn ~ (log(A/m//)) -1 ^ 2 - 
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Making a larger Higgs mass is therefore equivalent to bringing down the cutoff A rel- 
ative to the Higgs mass. Of course, this will generate larger cutoff dependent terms 
(scaling violation) in the physical scattering cross section. In the case of the lattice 
cutoff, the scaling violation is represented by the violation of Euclidean invariance. 
The old triviality Higgs mass bound was obtained by demanding that in a Goldstone 
scattering process (which is equivalent to WW scattering in the Standard Model 
according to the Equivalence Theorem), there was not more than a few percent Eu- 
clidean invariance violation in the scattering cross section [26]. It is evident from the 
above discussion that two things are crucial to set up the triviality mass bound of 
the Higgs particle. First, one has to know what the scaling violation will be when 
a certain type of regulator is introduced. Second, one has to have a well-defined 
method to calculate this scaling violation for a given set of parameters. In the case 
of the lattice Higgs bound study, the scaling violation is the Euclidean invariance 
violation and the method to calculate it is perturbation theory. Perturbation theory 
is a valid approach for the Higgs sector, because for all Higgs mass values below the 
bound, the coupling is weak enough for meaningful perturbative expansion. 

The old triviality Higgs mass bound was rather low because even at the upper 
bound value the renormalized coupling constant of the theory remains perturbative. 
In terms of the Higgs mass over vev ratio, R is only about 3. Further increase to 
the Higgs mass results in a scattering amplitude with large lattice effects and can 
no longer represent the low energy continuum theory. Therefore, if the hypercubic 
lattice will not be the new physics, then the existence of a strongly interacting Higgs 
sector is excluded in a lattice regulated scalar field theory. There has been great 
concern that this finding was an artifact of the lattice regulator itself which breaks 
Euclidean invariance. This concern is reasonable if we consider the analogue in 
QCD. We know that the linear sigma model, which is nothing but the 0(4) model 
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in the broken phase, will generate the right physics of QCD at low energies (low 
energy theorems, PCAC, etc.). However, the corresponding ratio m a j f n ~ 7 is 
much higher than in the Higgs case. Based on this analogy, technicolor models 
have been introduced which offers a possibility of strongly interacting Higgs sector. 
Due to the strong interacting nature of the technicolor at low energies, perturbation 
theory breaks down. Most of the analytic calculations are therefore performed using 
the effective chiral Lagrangian methods. A complete nonperturbative simulation of 
the technicolor theory including the dynamical fermions is very costly. Therefore, it 
would be nice to have a scalar model that can incorporate a strongly interacting Higgs 
sector. People have tried to perform the lattice calculation with better Euclidean 
invariance for the scalar models. The first significant increase of the Higgs mass 
bound (750 GeV) was reported [29] within the Symanzik improvement program on a 
hypercubic lattice structure [28]. Similar results on different lattice structures, with 
higher dimensional lattice operators in the interaction term, have also been reported 
[30]. 

In this thesis, I will study the scalar sector of the Minimal Standard Model 
and the Higgs mass problem by adding a higher derivative term in the kinetic energy 
of the Higgs Lagrangian. With the higher derivative term, we have a finite O(N) 
scalar field theory interacting via a quartic coupling constant. 

1.3 Higher Derivative Field Theory and Indefinite 
Metric Quantization 

There have been serious concerns about the potential difficulties in higher 
derivative field theories [5, 6, 8, 9, 10, 11, 12, 14, 16, 17, 18, 19]. I will briefly 
mention some of these difficulties in this section, and the detailed study will be the 
subject of the subsequent chapters. 
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First of all, as we will see in Chapter (2), the conventional quantization pro- 
cedure does not offer a meaningful theory because the spectrum is neither bounded 
below nor above. So, finding new ways of quantizing the higher derivative theory is 
necessary. One of the choices is the indefinite metric quantization [13, 14, 11]. By 
doing this the theory has a unique vacuum but, in the meantime, the positivity of 
the norm in the Hilbert space is lost. Therefore, one has to identify a subspace in 
the full Hilbert space as the physical space and maintain all the physical principles. 

Unitarity is one of these principles that one would like to maintain because 
this is at the heart of any quantum theory for which Born's probability description 
still applies. Before any meaningfully interpretation of negative probability is found, 
unitarity should be preserved in any physical theory. This is a big challenge for 
the higher derivative theories simply because the full Hilbert space is not positively 
normed, and negative normed states, also called ghost states, may violate unitarity. 
This is one of the main reasons why many people have abandoned the higher deriva- 
tive field theories. However, I will demonstrate that, there could be a scenario in 
which the ghost particles are represented as a complex conjugate pair, and unitarity 
is maintained [9, 11, 15]. This possibility was first pointed out by T. D. Lee in the 
late sixties. There have been a lot of discussions on this issue and it still remains 
quite controversial. 

Causality is another principle of the physical theory. As has been pointed out 
earlier by Lee [9], with the complex conjugate ghost pair, only microscopic causality 
is violated, and macroscopically it is very difficult to detect in the experiments (see 
Chapter (3) for further information). 
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1.4 Higgs Mass Problem in Higher Derivative 
Scalar Field Theory 

It is very interesting to study the Higgs mass bound problem in this higher 
derivative scalar field theory. There have always been several ways of viewing this 
theory. The first and most conventional way is to view it as the Pauli-Villars regu- 
lated Higgs theory [7, 11, 23, 24]. The second is to view it as a stand-alone, finite, 
well-defined theory with ghosts. The third is to view it as some truncated expan- 
sion of the effective low energy theory after the degrees of freedom representing the 
new physics have been integrated out. The original full theory probably has no ghost 
states, but after the truncation of the full series, the model may contain ghost excita- 
tions. Obviously, the distinction between the second and the third view is ambiguous 
since we do not know what the full theory should be. The first view, however, should 
be taken very carefully. Strictly speaking, this view is only valid in the limit of small 
m/M ratio, where m is the Higgs mass and M is the Pauli-Villars mass parameter. 
If the mass of the Higgs is getting close to the Pauli-Villars mass parameter, we have 
to take the second view and treat the theory as a finite theory with ghosts. In the 
limit of m/M — * 0, this finite theory coincides with the conventional O(N) scalar 
field theory with a Pauli-Villars cutoff. When the Higgs mass scale is comparable 
with the ghost parameter, the higher derivative field theory becomes a theory with 
complicated particle contents. 

To study the Higgs mass bound problem in the higher derivative O(N) model, 
we have to answer the same two questions. First, what is the scaling violation; 
second, how does one calculate it? 

The answer to the first question is not easy in the case of the higher derivative 
theory. Naively thinking, one would expect there should be some ghost effects. 
However, despite the negative metric ghost states in the theory, it remains unitary 
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and the scattering cross section of ordinary particles looks perfectly normal (see 
Chapter (3) for details). The only unusual effect found for the higher derivative 
O(N) model is the violation of microscopic causality. As has been mentioned above, 
this type of acausal effect is extremely difficult to detect. That is to say, introducing 
the higher derivative terms to the theory makes the theory finite, only at the cost of 
violating microscopic causality, which is invisible for practical reasons. One might 
still worry that, in this theory, all the results will depend on the ghost mass parameter 
and this is some sort of scaling violation. This leads us to the second fundamental 
question of the problem, namely, how to calculate the scaling violations. 

Obviously, if the Higgs particle remains light and the theory is still in the 
perturbative regime, we can do the perturbative calculation and find out how the 
scattering amplitude depends on the new parameter M. Whether to call it the 
scaling violation is still a question. It is some deviation from the Minimal Standard 
Model in the perturbative range. However, if the Higgs is heavy and the interaction 
is getting stronger, we will not be able to find out the scaling violation simply because 
we have nothing to compare with. In a strong interacting theory, we have no idea 
what the universal scattering amplitude will look like. In fact, we do not know how 
to define such a quantity meaningfully. A new nonperturbative interpretation of the 
Higgs mass bound therefore becomes necessary. 

From the above discussion, we can see that there are several major differences 
between the higher derivative theory and the conventional theory on the lattice in 
regards to the Higgs mass bound problem. First, the scaling violation in the con- 
ventional theory with the lattice regulator is unambiguously defined, both pertur- 
batively and nonperturbatively. Even without the help of the perturbation theory, 
we can quantify the violation of the Euclidean invariance meaningfully [32]. In the 
higher derivative case, however, the scaling violation is not well-defined, at least not 
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nonperturbatively. One can try to search the M dependence of the theory only in 
perturbation theory. 

Although the higher derivative theory is a finite theory, it still has infinite 
degrees of freedom. In order to carry out a nonperturbative simulation of the model, 
one must make the number of degree of freedom finite. This can be done by intro- 
ducing an underlying hypercubic lattice structure to the model. The lattice spacing 
a introduces a new short distance energy scale with the associated lattice momentum 
cutoff A = ir /a. In order to recover the higher derivative theory in the continuum, 
one would have to work towards the limit A/M — ■> oo with a fixed ratio of M/ ma- 
in so doing, the higher derivative O(N) model has the same scaling violation as the 
conventional model, that is, it violates Euclidean invariance. In the lattice higher 
derivative model, in order to recover the corresponding continuum model, one only 
has to eliminate the scaling violation that is associated with the lattice. For the 
higher derivative model on the lattice, one can view it as the conventional model on 
the lattice plus some so-called higher dimensional (or irrelevant) operators. 

Recently, Neuberger et. al. [30] reported a new Higgs mass bound based 
on the systematic search in all the dimension 6 operators added to the conventional 
Higgs model on the F 4 lattice. Based on their study, they claim that the triviality 
Higgs mass bound is m# = 710 ± 40 GeV, and this bound value is universal in 
the sense that no other higher dimensional operators will change it. However, our 
model discussed herein contradicts their conclusion. Our model can be viewed as the 
conventional model plus one dimension 8 operator, which is supposed to be irrelevant 
according to their study. However, from all our simulation results, we can easily drive 
the Higgs mass value into the TeV range (see Chapter (5) for more details). We 
believe that the so-called "irrelevant operators" are not irrelevant at all, at least not 
for the Higgs mass bound problem. After all, by adding new dimension 6 irrelevant 
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operators, Neuberger et. al. have found a rather different bound. Therefore, the 
notion of irrelevant operators is a very misleading one as far as the Higgs mass bound 
problem is concerned. As we discussed above, in our model, it is impossible to set 
up a precise Higgs mass bound due to the strong interaction. However, the model 
is able to accommodate a Higgs particle which is heavier than the old Higgs mass 
bounds with no lattice scaling violations. 

My thesis is organized as follows: in Chapter 2 , the quantization of the higher 
derivative theory is established using indefinite metric quantization. In Chapter 3, 
the higher derivative O(N) model is studied within the framework oil/N expansion 
and the important issue of unitarity and causality are also discussed . In Chapter 
4, the lattice version of the higher derivative field theory is presented and the pos- 
sibility of nonperturbative studies using Monte Carlo simulation is discussed, and 
the symmetry breaking mechanism in the finite volume is studied within the Born- 
Oppenheimer approximation. In Chapter 5, numerical results of the simulation are 
presented and analyzed. These simulation results demonstrate that the interaction 
of the higher derivative scalar field theory is much stronger than the conventional 
scalar field theory. Therefore, a heavy Higgs particle in the TeV range becomes a real 
possibility in the theory. Chapter 6 discusses the extraction of the resonance param- 
eters of the unstable Higgs particle in the finite volume using finite size techniques. 
This method, first suggested by Liischer [33, 34], has proved to work very well for 
the conventional O(N) model [35, 36]. We demonstrate that this also works in the 
higher derivative O(N) model after appropriate adjustments. In fact, we believe this 
is the only sensible way to extract the mass parameter in the simulation of a strongly 
interacting theory. 
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Chapter 2 



Higher Derivative Field Theories 
and Indefinite Metric Quantization 

2.1 Higher Derivative Oscillator 
2.1.1 Classical Hamiltonian 

Many important features of higher derivative field theories can be illustrated 
by their simple quantum mechanical counterparts. As an example, let us first study 
a higher derivative oscillator [8] given by the following Lagrangian 

1 ,„ n m 2 „^x.9 ,cos20 m 2 . 1 ...2 m 2 , , 
L = - 1 + 2— cos 29 A 2 - — + - T7 t)x 2 + —-X - —x 2 . 2.1 

2 M 2 ' y M 2 2M A> 2M 4 2 v ; 

This Lagrangian describes a simple harmonic oscillator of frequency m with second 
and third derivative terms added. For simple interpretation of the spectrum, the co- 
efficients of the derivative terms are given in terms of M and G; the only restrictions 
imposed are m/M < 1 and < < n/2. With the higher derivative terms added, 
this Lagrangian produces new features that are not present in the conventional the- 
ory. Classically, one can look at the time evolution of the position x(t) which is a 
solution of the corresponding Euler-Lagrange equation 

m 2 n ^,d 2 x ,2 cos 20 m 2 .dfix , , A d e x 9 ^ . . 

il + 2 W C032e) lF + l-ur- + + M IF + mx = °' (22) 
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Some of the new features of the higher derivative theory already appear at the 
classical level. For example, in order to specify the solution, one has to know more 
initial conditions than in the usual theory. In this particular example, one needs 
to know x^(0),n = 0, 1, • • • , 5 to specify a unique solution, where denotes the 
n-th time derivative of the variable x. This in fact tells us that the higher derivative 
theory has more degrees of freedom than the conventional theory. Another new 
feature is that there are runaway solutions to this classical equation of motion [2]. 
The Hamiltonian of a higher derivative Lagrangian was worked out long time ago by 
Ostrogradsky [1]. In the Hamiltonian formalism, new degrees of freedom show up 
explicitly due to the higher derivative terms. In this particular example, there are 
three independent coordinates and their corresponding conjugate momenta, given by 

qi = x, q 2 = x, q 3 = x, 



1,_ „m 2 . ,cos20 m 2 . ... 1 



cos 20 m 2 ... 1 -. . . 

= -hi^ + 2M^-2M^' (2 ' 3) 



?3 = 

Notice that pi is not proportional to x any more. Instead, both x and x become 
independent variables. In terms of these variables the Hamiltonian reads 

M A o 1,_ m 2 9 .cos 20 m 2 , 9 m 2 9 . t . 

H = Piq2+P2q3 + — P ^--(l + 2—cos2Q) q2 2 + (^^ + — )q 3 2 +— q x 2 . (2.4) 

The classical equation of motion can be written out in the Hamiltonian form 

d_ _ dE_ d_ _dE_ 
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where % runs from 1 to 3. It is easy to verify that the Hamilton equations of motion 
are equivalent to the Euler-Lagrange form, once we have expressed everything in 
terms of qi(t) = x(t) and its time derivatives. Note , however, that this Hamiltonian 
does not look like the conventional Hamiltonian at all. The limit of small m/M is a 
singular limit, and we will not be able to recover the standard oscillator Hamiltonian 
by taking this limit. 

2.1.2 Quantization 

Let us now try to quantize this Hamiltonian with the conventional canonical 
method. We will treat qi, q 2 and q 3 as independent variables and they have the usual 
commutator with the corresponding momenta 



with other commutators vanishing. This already says something unusual about this 
quantum theory, namely that the position of a particle and its velocity are indepen- 
dent variables and can be measured simultaneously; while in conventional quantum 
mechanics they form a conjugate pair and cannot be measured simultaneously. In 
the higher derivative theory, it is the quantity pi that cannot be simultaneously 
measured with qi . From the expression of p\, it implies that the measurement of x 
together with x , x together with x and 'x together with x are impossible. 

It is not very easy to see that the quadratic Hamiltonian in Equation (2.4) 
still represents the oscillator spectrum. In fact, using a linear transformation, the 
quadratic part of the Hamiltonian can be diagonalized exactly 



with M. = Me %@ and M = Me lB . The creation and annihilation operators ap- 



[ft,Pj]=i<^ i,j = 1,2,3, 



(2.6) 




(2.7) 
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peared in the above equation are linear combinations of qi and pi and satisfy the 
following standard commutation relations: 

[a,at] = [6,6t] = [ c>c t] = i. (2.8) 

The other commutators all vanish. This type of spectrum has many problems [8]. It 
is not bounded below, not even the real part. Therefore, no ground state exists in 
this theory. This unboundedness is a very common feature to all higher derivative 
quantum theories. It is a direct reflection of the the "wrong sign" in front of one of 
the quadratic terms in the Hamiltonian. One way of dealing with these problems is to 
try another quantization procedure and this is where indefinite metric quantization 
[3, 4, 7] comes in. 

The idea of using negative metric in the quantization procedure was intro- 
duced long ago, especially for the quantization of gauge fields [7]. In this framework, 
the full Hilbert space is too large for physical interests. It contains negative normed 
states which are necessary for the consistent quantization. The negative normed 
states must be removed from the physical subspace to maintain the probability in- 
terpretation of the theory. We will apply the same idea here [5]. 

First, notice that by appropriate scaling: qi — > pq\ and q 2 — > qij p, with 
p 2 = 1 + 2m 2 cos26/M 2 and by the change (p 2 ,q 2 ) — ► (—92,^2), we can rewrite the 
Hamiltonian into the following form 

H = p lP2 - I + I - — q 2 q 3 + \{m 2 + 2M 2 cos20)g 3 2 + \m 2 p 2 q\. (2.9) 
A A p A A 

Now make the substitution 

p 2 -> +ip 2 , q2 -> -iqz- (2-10) 
This will not change the commutator of q 2 and p 2 and we may write the Hamiltonian 
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as 



H = \PI + \PI + \PI + lQ T MQ, 



(2.11) 



where the P's and Q's are related to original variables by the following table 

Pl=Pl, P2=P2 + ipi, P3=P3, 

Qi = qi ~ iQ2, Q2 = ?2, Qs = 93- (2-12) 
We have used the matrix notation Q and Q T and the mass matrix M is 



M 



V 



2 2 -22 

m p im p 





22 22 




imp —mp 


i — 

p 


iMl 


m 2 + 2M 2 



(2.13) 



Negative metric quantization corresponds to demanding that the p's and g's are 
hermitian, so the Hamiltonian (2.11) itself is not hermitian. Rather, it is self-adjoint 
with respect to a metric operator r\ satisfying 



m2 rj = - q2: 



VP2V = -P2, 



(2.14) 



V 2 =l, 



7] = 7] 



t 



In this indefinite Hilbert space, the inner product of any two states, \ip) and 
is defined to be {^\r]\(J)). It is easy to show that the expectation value of any 
self-adjoint operator is real in any states. Therefore, the expectation value of the 
Hamiltonian in any state is real, although the eigenvalues of the Hamiltonian may 
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be complex. This immediately implies that the complex energy eigenstates have 
zero norm. The dynamics of any state vector are still governed by the Schrodinger 
equation 

i^|*(f)> (2.15) 
It is easy to show that the norm of a state is still preserved under time evolution. 

2.1.3 Diagonalization 

We can now perform transformation of the variables Q and P according to 
a "rotation" A 

Q = AQ, P = AP, 

A T A = AA T = 1, (2.16) 

and diagonalize the mass matrix M. The eigenvalues of this matrix are simply 
given by m 2 , M 2 = M 2 e 2te and M. 2 = M 2 e~ 2l& . This is why we chose compli- 
cated parametrization of the Lagrangian. Therefore, we can define the creation and 
annihilation operators as 
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These operators satisfy the standard commutation relation 

[aH, fl W] = [&("),&(+)] = [c(-\ c< + >] = 1, (2.18) 
and the Hamiltonian has the standard oscillator form 

H = (a^V^ + \)m + (6 (+) 6 ( - } + \)M + (cW^ + \)M. (2.19) 

jL jL Z 

The ground state is defined to be the state which is simultaneously annihilated by 
a (-) j fr(-) anc i c (-)_ We assume that the ground state is positively normed to 1. Neg- 
ative metric is seen from the adjoint relations among the creation and annihilation 
operators 

V a ( '^r] = a i+ \ rjb^rj = c (+) , rjc^r) = &(+>. (2.20) 

We can then build up our full Hilbert space by applying the various creation operators 
to the ground state. The eigenvalues of the Hamiltonian can, in general, be complex 
if the complex ghost pair is not evenly excited. All the eigenstates with complex 
energy have zero norms. This is a common feature for all self-adjoint Hamiltonians. 
The excited states are constructed and normalized according to 

f a (+))n a f b (+))n b ( c (+))n c 

\n a ,n b ,n c ) = V ' y ' y ' 10,0,0), 
VnJ \/n b \ y/n c l 

(n' a ,n' b ,n' c \r)\n a ,n b ,n c ) = 8 na ,n> a K b ,n> c K c ,n> b - (2.21) 
2.1.4 Ground State Wave Function 

We can work out the coordinate space wavefunction for the ground state by 
substituting the old variables. We get 

(i m 3 sin 50 i m 5 sin36> 
-y -j PCs2^ e g ' 
^ 1 Z M 2 cos Z.VJ -f- M 4 
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/m i M| sine _ 1 

2 l-2f|cos2e + fi y2 



™ ^4 1 M 3 sin 56 , M 5 sin 36 

" t \ x to 3 sin 26 _ m 5 sin 26 „2 

2 l M 4j i_2^cos26 + ^ 3 



j m sin 36 m 3 sin 6 



+ m ^%26 M* sm20 / • ) 

1-2^ cos 29 + ^^ 



3 1 _i_ M 3 sin6 _ M sin 30 
'' t ' m 3 sin 26 m sin 20 

M3 i_ 2 ^cos20 + ^ 



(«925s) 



m 3 l_(m + M\ sin0 \ 

+ iSSi-aSccae + fil^J- (2 ' 22) 

In order for the ground state to be normalizable, some constraints must be put on 
the parameters M/m and 0. First of all, the normalization condition is somewhat 
different in the case of indefinite metric quantization. The condition is 

(0M0) = (0|77ki,?2,? 3 )(?i,?2,?3|0) = 1, (2.23) 

where we have omitted the sum (integration) over the q^s. The ground state wave 
function given above is just (qi, q 2 , ffeMO). Therefore, due to the existence of 77 which 
flips the sign of q 2 , the normalization condition for the ground state wave function 
is written as 

J d qi dq 2 dq^*{q u -q 2 , q 3 )*(qi, q 2 , Qs) = 1- (2.24) 

Now we can write down the sufficient condition for this Gaussian type integral to 
converge. Since the quantity 1 — 2(m 2 /M 2 ) cos 26 + m 4 /M 4 is always positive, the 
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condition for normalizability reduces to the following 

/o(m/M,6) >0, f (M/m,Q) > 0, 

A (m/M, 0) > 0, fo(m/M, 0)/ o (M/m, 0) - A (m/M, 0) 2 > 0, 

„ , , o sin 50 5 sin 30 

sin 20 sm 20 

^• e ) = ^ + ^ST2e- L < 2 ' 25 ' 

The condition A (m/M, 0) > is equivalent to the condition < < ir/2. In order 
to fulfill the other conditions the parameter pair (m/M, 0) has to be in some range. 
In Figure (2.1), the function f (x, 0)/ o ((l/x), 0) — fi(x, 0) 2 is plotted as a function 
of for some values of x = m/M. Since this combination is symmetric with respect 
to the change x — > (l/x), it is sufficient to study the behavior in the parameter 
range < x < 1. It is seen from this figure that for any value of the parameter x, 
there exists a critical value Q c (x) below which the ground state normalizability is 
preserved. In Figure (2.2), this function is plotted in the whole range < x < 1. As 
a result, if we restrict the angle to be less than about it/ 3, the ground state wave 
function is normalizable for all values of m/M. 

It is useful to have the expression of qi in terms of the creation and annihi- 
lation operators: 

<?i = 



^/(i-^xi-if) v^J(i -$x-i+$: 



c (+) + c (-) 

+ _ , (2-26) 



v / 2A? 1 /(l-^)(-l + ^ 
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0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 

20/tt 

Figure 2.1: The expression fo(x, 0)/o((l/x), 6) — 6) 2 , as given in the above 
equations for various values of x is plotted versus the variable 2Q/n. This combina- 
tion is always positive for a given value of x as long as is less than some critical 
value c (:r). 

This concludes our discussion of the higher derivative oscillator. 

Note that, if we have an extra term — A x 4 in the starting Lagrangian, then 
our Hamiltonian would consist of two parts, H = H + Hi, where H is just the oscil- 
lator Hamiltonian discussed above and Hi = X qf with qi given by Equation (2.26) 
. Thus, the oscillator gives us a good starting point for perturbation theory. 
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0.2 0.4 0.6 0.8 1 

x = m/M 

Figure 2.2: The critical value <d c (x) is plotted as a function of x — m/M. All the 
parameter pairs (x, 0) below this curve will ensure the normalizability of the ground 
state wave function. 

2.1.5 Euclidean Path Integral 

Now let us evaluate the partition function of the higher derivative theory 
defined by 

Z = Tre^ H = ^(s\rie- pH \s), (2.27) 

s 

where the summation is over all states \s) such that they are complete: 

J2\s)(s\ V =l. (2.28) 

s 

One convenient choice for the states is \qi,q2,Q3)- Then one can make use of the 
derivative forms of the momentum operators and derive a path integral form for the 



26 



partition function, just like in the usual theory. First one has to slice the Euclidean 
time (3 into small intervals and, then, the partition function is written in terms of the 
integration of the intermediate positions. This path integral form of the partition 
function is exactly the Euclidean path integral that one would naively write down 
when not concerned with the canonical quantization procedure [6, 10] 



Z\J\ = J Dqe W (- iyrL E [q(r)] + J(r)q(r)y 



l._ n m 2 . 9 ,cos20 m 2 . 1 •■■ 2 m 2 9 _ 

The Euclidean propagator of the variable q{r) can be found by differentiating the 
partition functional with respect to the external source J(r). In Fourier space, it is 
given by 

M 4 

D e( e ) = jjp + m 2^ E 2 + M 2 e 2ie )(E 2 + M 2 e 2lB ) ' ^' 3 °^ 

The multiple pole structure in the propagator is a manifestation of the spectrum of 
the theory. As can be seen clearly, the poles are located exactly at three types of 
energy gaps of the theory. 

There is a big difference here in the higher derivative theory as compared with 
the usual theory. The Minkowski path integral [5, 9] is not well defined. In fact, due 
to the complex ghost energy, it has runaway modes at large temporal separation. 
Also, we cannot do a wick rotation from the Euclidean to the Minkowski because of 
the complex ghost pole on the first sheet. We should emphasize that the Euclidean 
path integral is still well defined. This is the object that we will be using in our 
numerical simulation of the theory. Also, the Euclidean path integral in principle 
contains all the information about the higher derivative theory. By measuring the 
Euclidean propagator of the theory, one can extract the energy excitations of the 
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higher derivative theory and, hence, the eigenvalues of the Hamiltonian. 

2.2 Higher Derivative Free Field Theory 

Having discussed the quantum mechanical oscillator, let us now turn to the 
simplest higher derivative field theory, free field theory. Since most of the procedures 
are quite similar to the quantum mechanical case, we will be very brief in this section. 
Consider the one component higher derivative scalar field theory parametrized by 
the Lagrangian 

C = l -ct>{x){-U - mg)(l + + ^jiMaO, (2-31) 

where the □ is the Minkowski d'Alambert operator. The Hamiltonian density can 
be obtained in the same way as in the quantum mechanical example 

M 4 1 1 

H = 7Ti0 2 + ^203 + ~ ^^{pi ~ 2 PsV 2 + 3p 3 V 4 )0 2 + ^h(P2 ~ 3p 3 V 2 )0 3 

+^0i(-PiV 2 + p 2 V 4 - p 3 V 6 + m 2 )0i. (2.32) 

Again, we can interchange the role of 7r 2 and 02, which amounts to 02 — > 7r 2 and 
7r 2 — > —02 • We also impose negative metric on 7r 2 and 2 by doing the substitution 
2 — > — £0 2 and 7r 2 — > +?7r 2 , and after these changes our Hamiltonian density is, 

11 1 

H = mi-K 2 + T^ -7 ^ 2 + ^(pi - 2p 2 V 2 + 3p 3 V 4 )7r 2 + -0 3 (p 2 - 3p 3 V 2 )0 3 

+ ^0i(-PiV 2 + p 2 V 4 - p 3 V 6 + m 2 )0i + 20203- (2.33) 

Negative metric quantization then corresponds to making 0j,i = 1,2,3 and 7r», i = 
1,2,3 hermitian operators. Then the Hamiltonian itself is not hermitian but still 
self-adjoint with respect to the negative metric 77 which flips the sign of 7r 2 and 2 . 



28 

We have 

rjU^rj = H. (2.34) 

Introducing the Fourier modes 

0,(x) = 0, + E Affi^M^ + 0* k e^ k ' x ], (2.35) 
k>o v y 

where the index i runs from 1 to 3. We can also write the similar expression for 7Tj 

tt,(x) =n t + J2 -±=[^e-^ + 7T* k e* k - x ]. (2.36) 
k>o W 

To ensure basic commutation relations, we must have 

[&,k, TTj.k'] = [0*, k , 7rj, k '] = i^kk', Tfj-] = i^ij ■ (2.37) 
We can then write the Hamiltonian as 

H = — (OTi7T 2 + ^— + y7T 2 ) + V{ — (j) 3 + —01 + 2020 3 ) 

+ E <Kkf 2% + <k^2,k) + + (pi + 2p 2 k 2 + 3p 3 k 4 )7r 2ik 7r 2 * ik 

k>0 P3 

+ ( Pl k 2 + p 2 k 4 + p 3 k 6 + m 2 )0 lik 0t k + (p 2 + 3p 3 k 2 )0 3 ,k03,k 

+^(02,k03,k + 0;, k 03,k). (2.38) 

After some rescaling of the variables we can bring the Hamiltonian into similar form 
as in the quantum mechanical oscillator case 

H = H + J2 <k^k + o; k M, ; o /k . (2.39) 

k>0 
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We then perform the same "rotation" transformation as in the oscillator case, and 
then the above Hamiltonian is diaganolized to 



h = h + J2 n* k n Jk + $* k ^ k $ Jk , 



(2.40) 



k>0 



where the frequency w k = \J m o + k 2 , cj lk = V ' M 2 + k 2 and cj 2 k = \J -M 2 + k 2 . The 
creation and annihilation operators are given by 



- 7 =(^$ !k + -^n* k ), 



l ik 



= ^(V^k^k 



/^ik 



n* k ), 



n-k 



1 i 
-WV^ik + ^^iW), 

v ^ v w * 



a (+) 



1 i 

V2 V^ik 



with i = 1, 2, 3. The Hamiltonian finally looks like 

h = E(« l ( k ) «!k ) + o V*, 



(2.41) 



(2.42) 



where the summation is over all the momentum modes and three types of excitations. 
The creation and annihilation operators have the standard commutation relations 



(2.43) 



Similarly the field <f>(x) can be expressed as a linear combination of the creation and 
annihilation operators which will be given explicitly in the next section. The particle 
contents of this free Hamiltonian is now clear. One has three types of excitations 
for each three-momentum k. The operator creates an ordinary particle of mass 
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mo, momentum k and energy c^ok — \f m o + k 2 . The operator creates a ghost 
particle of mass M., momentum k and energy c^ik = V -M 2 + k 2 . The operator 
creates an antighost particle. 

2.3 Higher Derivative O(N) Model in the Sym- 
metric Phase 

The higher derivative field theory can be easily generalized to an O(N)- 
symmetric scalar field theory with a quartic coupling. In the symmetric phase it is 
convenient to parametrize the Lagrangian as 

I 77? 2 

C = -2( 1 + 2 ^ cos20 )^ D ^ 



.cos 26 m 2 , . ,„ 9 ,„ 1 ,„ , ,„ 



2 

a a - A o (0 a a ) 2 . (2.44) 



2 



The Hamiltonian of the theory, after indefinite metric quantization, can be expressed 
in terms of creation and annihilation operators, H = H + H int , where the free 
part of the Hamiltonian is given by Equation (2.42). The interaction part of the 
Hamiltonian is the conventional one, namely H mt = J <i 3 xAo(0 a a ) 2 , where the field 
a can be written as a linear combination of the creation and annihilation operators, 



+ (a£ V* + aij'e^) (2.45) 
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where the values for q are given by the following list: 

Cl =M" 4 \{M 2 -m 2 )(M 2 -M 2 )]-\ (2.46) 

The negative metric is seen from the adjoint relations among the creation and anni- 
hilation operators 

"Op 

a lp 

where 77 is the metric operator satisfying rj — and rf = 1. It is clear that the 
Hamiltonian itself is self-adjoint with respect to the metric 77, i.e. H = 77/^77 = if. 

2.4 Higher Derivative 0(N) Model in the Broken 
Phase 



(-)of (+)a 
V a 0p V = a 0p , 



(-)of (+)a 
Wp »7 = «2p , 



(-)of (+)a 
^2p V = Alp , 



(2.47) 



One starts with the general higher derivative Lagrangian which has a global 
O(N) symmetry 

c = - p 2 n 2 - P ^w + - v (2.48) 
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where □ = <9 2 — V 2 is the Minkowski space d'Alambert operator and the coefficients 
are parametrized as 

ml ml 1 1 ml 1 

Pl = l + W + W p2 = Ml + W + MW> p3= M^- (2 ' 49) 

After the usual steps of indefinite metric quantization, the Hamiltonian has the form 
H = m X + ^7rX + ^(Pi-2p 2 V 2 + 3p 3 V 4 )7r2 

+ ^?(-PiV 2 - p 2 V 4 - p 3 V 6 )0? + ^(p 2 ~ 3p 3 V 2 )^ + i<j>%<j>% 

~ ^0W + A o (0M 2 . (2.50) 

The corresponding 0(N) generators are given by 

Q ab = (x)^(x) - (x)<(x), (2.51) 

which obviously commute with the Hamiltonian. 

Next, the Fourier modes are introduced for each variable 

1 



>?(*) = '■>■■ ■ -= X o" !M t ' k * • • 

k>0 



«*> - ^4 + wS^4 +e+,kx i- (2 - 52) 

The Hamiltonian is brought into the following form: 

+ E <k< + <> 2 a k + + (Pi + 2p 2 k 2 + 3p 3 k 4 K k 4* 

k>0 P3 
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+ ( Pl k 2 + p 2 k 4 + p 3 k 6 )<M£ + (p 2 k 4 + 3p 3 k 2 )0« k 0« k + *02k03k + «03k 
-E^o0W + E^o(0M 2 . (2.53) 

X X 

We will single out the direction of the 0f variable and fix it in some direction in the 
O(N) space. This treatment is only valid in the limit of infinite volume. Strictly 
speaking, in a finite volume, the symmetry is not broken. Therefore, the description 
of symmetry breaking in the finite volume needs more careful study. As we will 
show in Chapter (4), in a very large but finite volume, one can apply the adiabatic 
approximation (or Born-Oppenheimer Approximation) to the direction of the zero- 
mode. We find that the direction of the zeromode rotates very slowly and decouples 
from the other modes in the theory. Therefore, if the volume is very large, it is 
legitimate to assume that the direction of the zeromode is frozen in the O(N) space. 
With this in mind, we can then decompose 

^1 = vn a + h(x)n a + ^ T (x), (2.54) 



and similarly for the 2 and 3 variables. The value of v is set to y / p^/4Ao. The 
Hamiltonian is then written as sum of three types of terms: 

H = H + i/k^o + H int , 



H k ^ = £ < k 7r 2 a * + i7« k + -7r 3 > 3 - + ( Pl + 2p 2 k 2 + Sps^KkTr- 

k>0 P3 



+ ( Pl k 2 + p 2 k 4 + p 3 k 6 + m 2 )0? kL <& + (pik 2 + p 2 k 4 + p 3 k 6 )0? kT 0^ T 
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H in t = E 4A o^(^ 2 + 0it0it) + Ao(^ + 0it0?t) 2 , (2-55) 

x 

where = 2/i 2 . We will examine each piece separately. 

The k ^ piece can be diagonalized the same way as in section (2.3). The 
interaction piece is also expressed as the creation and annihilation operators through 
the field variables. The H piece is the only one that is new in the broken phase. 
For convenience we use the rescaled variables given by 



Pi 



a I „ \r\ — l/2_a __a Pi I _ \/"\ — l/2„a 

- (pil/J ' 7T 10 , p 2 = W — 7T 20 , p 3 = (p 3 l/J ' 7T 30 , 



9? = (piV) 1 ^?, <& = J-fc q a , = (psV) 1/2 r 3 , (2.56) 

V Pi 

and express the radial variables qf as 

9? = \fpiV(v + a)n a = pn a . (2.57) 
The derivatives for the qf are now substituted by 

hr n 'h (2 - 58) 

where the index a runs from 1 to N. We have assumed that the volume is practically 
infinite and the direction n a is really a constant unit vector in O(N) space. As we 
will see in Chapter (4), this is only approximately true in a finite volume. Use the 
following identity 

ip a lP a 2 = ip 2L Pi P , (2.59) 
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H is further decomposed into two parts 



Ho = H 0L + H 0T , 



Pi „2 



rn 



o „,2 



H 0L = ip 2 LPy + -P 2L + oP 3 L + 7^%L + HI —<l2LqZL + ^V" , 

z i zp 3 y p 3 zpi 



H 



OT 



n P2TP2T + n P3TP3T + o Q3TQ3T + H Q2TQ3T- 
1 1 l P3 V Pi 



(2.60) 



The longitudinal part has the same form as the simple oscillator and can be easily 
diagonalized. The transverse part H t can also be diagonalized with the transfor- 
mation 



q T = AQ T , AA t = A T A = 1, 



(2.61) 



where A is a two by two matrix 



/ -1 1 N 

(l_ e -4ie 9 )l/2 (l_ e +4if 9 )l/2 



V (l- e - 4i «9)i/2 (l_ e +«» 9 )l/2 J 



[2.62) 



The angle 6L is the complex phase of the Goldstone ghost mass parameter M„ = 



\M.g\e ie<) , which is given by 



M 



2 ml + M 2 + M 2 -r- I I, ml M M 



+ iMM\ Pl - 7(77= + =f? + =f?) 2 . 
V A y MM M M 



[2.63) 



The transverse part of the Hamiltonian is then diagonalized to 



TJ \ ^ (+)o ( — )o 

-"OT - 2^ a iOT a iOT ^iOT, 



(2.64) 



where the summation of a is from 1 to N and the energy gap is ujiot = M. g and 
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uj 2 qt = M-g- In terms of these operators we can write out the explicit form of p 2T 



P2T = E 



^OT , f J-)a (+)ax 
a iOT ) 



"€,;, (a. 



(2.65) 



where the polarization factor q is given by t\ = e 2 = — e ^s) 1 / 2 . 

To summarize, in the broken phase we would have the following Hamiltonian 

H = H + H int , 



tit) - 2^ fl ikA a ikA ^ikA, 



#int = Y, 4X 0Vh(h 2 + ^ T 4> a lT ) + X (h 2 + r iT 4> a lT ) 2 . 



(2.66) 



The index A takes the value L and T respectively. All the operators can be expressed 
in terms of the creation and annihilation operators as 



M*) = E 



ikL 



^(x) 



V ° lT (a ( - )a 



(2.67) 



Pl V(v + a) = yJpiVlv + Y, 



where the form factors C; L \ are given by the following table 



CiL ( fl H, fl (+M 



col 



M 2 M' 



\l K - M 2 )(ml - M 2 ) 



ClL = C 2L 



M 2 M Z 



\l (M 2 -m 2 )(M 2 - M 2 )' 



cot 



C\T — C 2 t 



\ (M 2 g - Mg) 



(2.68) 
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The creation and annihilation operators enjoy the following commutation relations 

[4^.^]=W*a'^- (2-69) 
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Chapter 3 



Unitarity and Large TV Expansion 

3.1 Lippmann-Schwinger Equation And Unitarity 

In this section, we will try to answer one of the most important questions 
about higher derivative theories, namely, the unitarity problem [1, 2, 3, 4, 5]. In 
the first part of the discussion, we will set up the general formalism of scattering 
matrix in the higher derivative theory and argue that the S'-matrix defined within 
the physical subspace can be made unitary. In the second part, we will present a 
concrete example of the unitary scattering amplitude in the large N limit of the 
O(N) model which involves the ghost states as intermediate states. 

3.1.1 General Formalism and Unitarity 

Let us imagine that our Hilbert space is built up by all the states gener- 
ated from the vacuum by successive operations of creation operators as described 
in Chapter (2). Some states will have negative norm and complex energy compo- 
nents. We assume that all states available to build the initial state contain only real 
energy components of the free Hamiltonian in all Lorentz frames [2]. We will call 
these states "normal states" or "physical states" . Denote the eigenstate of the free 
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Hamiltonian by \<f> a ) such that 

Ho\<j> a ) = EM, (3.1) 
E a G 3?. 

Then one can construct two states, denoted as |^ + ^) and \^>^), from the Lippmann- 
Schwinger equation 

l#> = l*«> +p ^— V|^>, 

h, a — Hq± ie 

l^> = l^) + jgQ _^ ± ./ |0a). (3.2) 

It is then easy to show that the states |^ ± - ) ) are eigenstates of the full Hamiltonian 
with corresponding energy E a . If we now form wavepackets from these states, one 
can see that they correspond to incoming and outgoing waves in the past or future. 
Therefore, they give us a good description of the scattering process. We can rewrite 
the above equation as 

In this form, it is clear that only energy conserving components of \<f> a ) survive the 
scattering since, if the components are of different energy, they will make the operator 
(E a — H ± ie)^ 1 nonsingular in the e goes to zero limit, hence are killed by the e in 
front. The S- matrix between any two normal states a and (3 is then defined to be 

S Pa = (V^M^) (3.4) 



= { ^E B -H + i*E a - % H + ^ a) - 
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Using the perturbative expansion of the Green's function one can show that the 
S'-matrix element defined above is related to the so called i?-matrix (or T-matrix) 
element by 

S/3a(E a ) = 5p a — 2iri5(E a — Ep)Rp a (E a ), 
R/3a(E a ) = (<t>p\R{E a )\4> a ), 



R(E) = V + V- ^ -R(E), (3.5) 



R(E) = V + V———-V + 



R(E) = V + V 



E — 


H 


+ ie 




1 




E- 


H 


+ ie 




1 


T 



E-H + ie 

To show unitarity, we write the Lippmann-Schwinger equation in a special way 

|#»> = n'*\E a )M, (3.6) 



E a -H + ie E a -H Q ±ie 

where QS + \E a ) and QS~\E a ) are called wave operators. Using the adjointness of 
the Hamiltonian we see that 

\<P a )= V rt±\E a )^\^). (3.7) 
From these two relations we can see that 

r 1 n i± \E a yr 1 ^ ± \E a ) = 1 + P c , (3.8) 
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where P c is the complex energy projector for the free Hamiltonian. Let us now 
consider the sum 

E s Pa s; a = E^M^X^M^) 

= (V^M^) (3-9) 
= (^|fi(-)(^)^Q(-)(^)|0 7 ) 

= (0/3^|0 7 ) = <W 

where, in the second step, we have inserted the complete set of the full Hamiltonian. 
This establishes the unitarity of the ^-matrix. The above proof looks very formal. 
To clearly understand the role of the ghost states in the theory let us calculate some 
scattering processes in the higher derivative O(N) model. 

3.1.2 Large-iV Scattering Amplitude of O(N) Model 

The basic formula is the perturbation expansion of the ^-matrix given by 
Equation (3.5) . We will consider the large- N limit of the geometric resummation of 
the s-channel bubble diagram. We will show how the modified Feynman Rules arise 
naturally from this calculation. 

First, we calculate the i?-matrix elements to second order in bare perturba- 
tion theory of the higher derivative O(N) theory in the broken phase. The final large 
iV scattering amplitude can then be formed from the geometric resummation of the 
bubbles and the large N Higgs propagator. The calculation in the symmetric phase 
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is quite similar. We will parametrize the i?-matrix elements as 



ext 



R a , = -W5\Y,v)\{\l^yM aP) (3.10) 



where the amplitude M. a p is the Feynman amplitude. The lowest order is trivial 
namely 

-M$ = -8A . (3.11) 

To the second order, we have to consider the intermediate states contributions and, 
in the large- N limit, this reduces to only s-channel scattering. This leads us to the 
study of the following one loop contribution, 

M m -Qfi/V^A 2 f ^ ( gg? ^ 2 Kq + ^,P-q) (0 U ) 

Ma, ~ 96N X J ^ {^j—J E2 _ Kq + u; . p _ q)2 + ie ' ( 3 - 12 ) 

in which two types of intermediate states are included, one has energy u itCl + Wj, p - q , 
the other has energy 2E + a; ijq + ^• iP _ q . This form can be brought into the usual 
loop integral form by using the identity 

dq 1 1 —% 2(cui + cj 2 ) 



(3.13) 



ic 2n (g — E) 2 — cuj ql — oj\ 2uji2uj 2 E 2 — (cui + uj 2 ) 2 + ie ' 

where the complex contour is a contour that separates E — uj\ and — oj 2 from E + u\ 
and u) 2 as shown in Figure (3.1). However, this type of contour could have some 
pinching problem [2, 3]. The problem only occurs for the ghost and antighost pair 
contribution in the above equation, i.e. iQ\ = u^. In order to see how the potential 
pinching problem occurs, we have shown the movement of the poles in the complex q 
plane as the center of mass energy increases in Figure (3.1) The appropriate contour 
before the pinching is also shown. It is easy to see that if the center of mass energy 
E is less than the so called ghost antighost threshold u\ + uo 2 = 2Reui, there is no 
pinching and the contour is well defined. As the center of mass energy increases, 
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Figure 3.1: The complex contour of the integration variable q Q . The triangles are 
the poles from one of the ghost antighost contributions. The squares are the poles 
when the roles of iO\ and cj 2 are interchanged. As the center of mass energy E 
increases, the movement of the poles are also shown by the arrows. Pinching occurs 
when E > (uji + uj 2 )- 

four of the eight poles move to the right and two of them pinch with ui\ and u>2- One 
then has to specify how to deform the contour in the case of pinching. This type of 
contour deformation in the presence of possible pinching was also discussed before by 
Cutkosky et al. [3]. They started directly from the integral representation and tried 
to define a contour when the ghost and antighost masses are not exactly complex 
conjugate of each other, namely Mi — M| = iA, where A is some small parameter. 
Then, for every nonvanishing A, they were able to find a suitable contour. The 
final result is defined to be the limit where A — > 0. The corresponding contour is 
shown in Figure (3.2). This prescription has a disadvantage that it is not justified 
with any theoretical consideration. In fact, the Cutkosky prescription is only one 
of the many ways to analyticly continue the integral (3.13). To specify the "right" 
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Figure 3.2: The complex contour of the integration variable q as discussed by 
Cutkosky et al. The pinching is avoided by splitting the ghost and antighost masses 
by a small imaginary amount. 

analytic continuation, one would need some input from the Hamiltonian description 
of the theory. We have started from the Hamiltonian picture of the theory, so the 
Hamiltonian should tell us how to define our contour. Note that pinching only occurs 
when the ie prescription is not applied to the integral. With the ie, however, the 
pinching is avoided for real value of s and we can always find a suitable contour. This 
contour is shown in Figure (3.3). It is clear that our contour differs from the one 
discussed by Cutkosky et al. and therefore, our final result is different from theirs. 
Now the Feynman amplitude can be expressed as 

d 4 q Cj Ci 



M% = -mN\lY,l 



(3.14) 



J Cij (2vr) 4 ql - u;? q + ie (q - Ef 

This amplitude represents the Feynman diagram as shown in Figure (3.4). We can 
shift the integration variable go and after a Wick rotation we can perform the q 
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Figure 3.3: Our complex contour of the integration variable q in the presence of 
possible pinching. The pinching is avoided by using the +ie prescription which is 
derived naturally from the Lippmann-Schwinger formalism of the Hamiltonian. 

integral. The integral itself is finite due to the modification of the propagator at 
large momentum and we are left with an integral with Feynman parameter only 

M% = 9QN\ 2 B(s), 

— 1 r 1 

B(s) = CiCj / dxloglxmf + (1 — x)rrlh — x(l — x)s]. (3.15) 

16~- Xj 

It is very interesting to study the imaginary part of the bubble integral. The 
imaginary part comes only from the angular part of the argument in the logarithm. 
The function being quadratic in x has two roots in the complex x plane which are 
given by 

F(s) = xm\ + (1 — x)m\ — sx(l — x) = s(x — Xi)(x — x 2 ), 
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Figure 3.4: The s-channel one loop amplitude of Goldstone Goldstone scattering. 
The solid lines represent incoming and outgoing Goldstone particles. The dashed 
line can be Goldstone particle, Goldstone ghost or antighost particle. 

x 1)2 = ^-(s - m\ + m\± sj\s - (mi + m 2 ) 2 ][s - (mi - m 2 ) 2 }), (3.16) 

where m x and m 2 can take values of three different masses in the theory. The most 
important combination is when both are Goldstone particles. Then, for s < 0, we 
have two conjugate roots whose real part is exactly 1/2, therefore, just by symmetry, 
there is no imaginary part contribution from this term. This corresponds to the 
case that the center of mass energy is less than the threshold. Due to the massless 
Goldstone, the lowest threshold is at zero energy. However, when s > two roots 
are real and the imaginary part develops, we have 

SB(s + ie) = ^. (3.17) 

In fact one can show, just from the symmetric form of the integral, that this is the 
only imaginary part contribution to the diagram! For example the imaginary part 
from the mass pair M. and M. exactly cancels the imaginary part from the mass 
pair M. and M. and so on. In the large- N limit the scattering amplitude is obtained 
by summing the geometric chain of the bubbles and the tree contribution from the 
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intermediate Higgs state 



N n v 2 IN 

' :^oo 1 (^) = --T777I + ^ + 17^)- (3-18) 



32tt uu w s + s 3 /M 4 8Ao 2 

Taking the imaginary part of this equation, we get the Optical Theorem in the 
large- N limit 

$M 00 (s) = \A 00 (s)\ 2 . (3.19) 

The bubble integral can be exactly worked out. It is a function with rather compli- 
cated analytic structure. The detailed discussion is listed in the appendix. 

The scattering amplitude is obtained by taking s = \s\ + it on the physical 
sheet. It is interesting to study the Goldstone-Goldstone scattering cross section in 
the large- N limit. In Figure (3.5) we plotted the cross section as a function of y/s in 
Goldstone ghost mass unit. Here the Goldstone ghost pair has a complex phase of 
7r/4 and the peak corresponds to the Higgs pole on the second sheet. It is amazing 
that the ghost pair is so well-hidden in the tail of the cross section that it would 
be very difficult for experimentalists to determine that there is a ghost pair hidden 
somewhere. Also plotted in Figure (3.5) is the scattering phase shift as a function of 
center of mass energy. We see that the phase shift starts out increasing with y/s , and 
at the Higgs pole it has a sharp rise. If the Higgs particle were infinitely narrow then 
the rise would be exactly tc. Due to the finiteness of its width and the Goldstone 
background, the cross section differs from the description of Breit-Wigner shape. 
What is "unusual" about this theory is that the phase shift decreases as the energy 
gets through the real part of the ghost mass. This is an indication of possible acausal 
behavior in the scattering, because the sign of d5(s)/ds determines the relative phase 
of the scattered wave to the incident wave. Although for the scattering by a repulsive 
potential in the usual theory, this quantity can also be negative, it would become 
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1 2 3 1 2 3 
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Figure 3.5: The Goldstone Goldstone scattering cross section and phase shift is 
plotted against the center of mass energy in large- N expansion for the Pauli-Villars 
higher derivative O(N) theory. The input vev value is v — 0.07 in M unit. The peak 
corresponds to the Higgs resonance, which is at ran = 0.28 in M unit. The scattering 
cross section is completely smooth across the so-called ghost pole locations. 

acausal if this quantity becomes too large, that is, a sharp drop of S(s) with respect 
to s. In the ordinary theory, this can never happen. In the higher derivative theory 
with the ghost pair, it could happen if the ghost pair is sufficiently close to the real 
axis. It had been argued long ago by T. D. Lee [2] that, even in this case, such acausal 
behavior would only occur at microscopic scale typical of the Compton wave length 
of the ghost, and it will not lead to macroscopic disasters. In fact, this violation of 
microscopic causality is barely visible experimentally. 
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3.2 Large N Expansion of the Higher Derivative 
0(N) Model 

The higher derivative O(N) model can be studied in the large N limit. Many 
aspects of the theory can be illustrated in the large N expansion [6, 7]. The general 
formalism for the large N expansion has been previously studied [9, 10]. Let us 
briefly review the main ideas and focus on its application to the higher derivative 
O(N) model, and also emphasize the comparison of the higher derivative O(N) 
model with the conventional O(N) model within the large N approximation. The 
Hamiltonian picture that originates from the quantization was discussed in the last 
section, so we will only consider the Euclidean version of the large N approximation 
here. 

3.2.1 General Formalism 

Consider the partition function of the theory as expressed by the following 
Euclidean path integral 

Z = Jv<f>e- S M, 

s[j>\ = Jd^rgi-d'w + ^rr + ^irr) 2 , (3.20) 

where the field 4> a (x) is an 0(N) field and the function g(—d 2 ) is a polynomial 
function of the operator (— d 2 ) of the form 

= + C4 ( 9 4 } + c%{ _ 9&) + . . . _ (3 21) 

For example, our choice of the Pauli-Villars theory corresponds to the form of g(p 2 ) = 
p 2 + (1/M 4 )p 6 . To perform the large N expansion, it is convenient to introduce the 
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auxiliary fields x snch that the path integral is rewritten in the following form 

Z = J V</>Vxexp(- J d i x 1 -r(g(-d 2 )+^ + t X )r + ^)- (3.22) 
The effective potential can then be worked out in a standard fashion 

where the variable x is a function of determined from the gap equation 

*="» +4a °* 2+4 WS^f? (3 - 24) 

The vacuum can be found from the derivative of the above effective potential. Due 
to the gap equation, the derivative has the following form 

U'(4>) = 4>X- (3-25) 

Therefore there could be two phases of the theory. One with = 0, x > which is 
the symmetric phase; the other one has ^ 0, x — , which is the broken phase. In 
the broken phase, the vacuum expectation value is obtained via 

°="« +4A °" 2+4 W(iw < 3 - 26 > 

Note that unlike the conventional theory, the integral in the above equation is finite 
as long as we have take the highest momentum power in the propagator to be greater 
or equal to 6. The critical phase transition line is obtained by setting v to zero in 
the above equation, i.e. 

°=^W(Sw < 3 - 27 > 

As we will see in Chapter (4), the large iV prediction of the critical line is in very 
good agreement with the simulation. 
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We can work out the propagator of the fields in the large iV approximation. 
If we are in the symmetric phase, the leading order correction to the propagator is 
just from the mass renormalization. Therefore, we have 

sab 

<0 a (p)0 6 (p)> = if— ~, 

pZ _|_ m z 

m 2 = al + AXoN [ d %- A — -. (3.28) 

Po J (2tt) 4 g(k 2 ) + m 2 K J 

In the broken phase, the Goldstone propagator remains unchanged to the leading 
order but the longitudinal Higgs propagator is modified by the bubble summation 
of the Goldstone intermediate states. Thus, we have 

8\ v 2 



IW) = g( P 2 ) + 



l + AXoNBip 2 )' 



r d 4 k 1 

B ^ - iw m=mm - (3 ' 29) 

The scattering amplitude can be worked out in both the symmetric and broken 
phase. In the symmetric phase, 

N t _ u 2 v 1 . N + 8 



32tt uu ^ ' 24A n 6 



2 r d 4 k 1 

^ P > = J (2tt) 4 (g((k - p) 2 ) + m 2 ){g{k 2 ) + m 2 ) ' (3 ' 30) 

where m 2 is related to the bare mass parameter /j,q according to Equation (3.28). If 
we define the scattering amplitude at p 2 = to be — 3iV/47rA_R, we can express the 
above equation in terms of the renormalized coupling constant Ar 

N 2 , 1 . iV + 8, 



53 



In the broken phase, we have 



A 7 



4.V) 



1 N f d 4 k 1 



(3.32) 



32vr 



8A + 2 J (27i)* g((k- P ) 2 )gm + g(p 2 Y 



Note that although we are dealing with the Euclidean scattering amplitude here in 
large N, it should be understood as the amplitude arising from the Hamiltonian 
formalism described in the previous section. As long as the correct complex con- 
tour integration is implemented, and the analytic properties of these amplitudes are 
understood, the Euclidean amplitude will also give us the correct physical picture. 

We can also modify the above formalism to the theory on the lattice in a finite 
volume. All we have to do is to change the integral into finite lattice summations. 

Let us now show some examples of the application of the large A" results and 
see what we can learn from it. 

3.2.2 Renormalized Coupling Constant in the Symmetric 



In the first example, we compare the higher derivative O(N) model on the 
lattice and the conventional O(N) model on the lattice in the symmetric phase. As 
described in Equation (3.31), the renormalized coupling constant \r can be defined 

as 



where everything is measured in lattice unit. In this formula, the factor (N + 8) 
is the exact group theory factor. However, in the naive large A" approximation, we 
should use A" instead of (N + 8). When we apply this to the 0(4) model, this makes 



Phase 



l + 4Ao(AT + 8)/(0)' 



7(0) 




(3.33) 



54 

a factor of 3 difference. We therefore have to conclude that the leading order large 
N results are ambiguous when applied to N = 4. As we will see in the next example, 
similar situation occurs for the broken phase. We can calculate this renormalized 
coupling constant at the same correlation length £ = 1/m in the conventional O(N) 
model and in the Pauli-Villars theory, with some fixed value of M, for every value of 
the bare coupling constant A . The magnitude of this quantity reflects the strength 

5 
4 
3 
2 
1 


0.2 0.4 0.6 0.8 1 
X /(10+X ) 

Figure 3.6: The comparison of the large N renormalized coupling constant in the 
symmetric phase is shown for three cases: continuum Pauli-Villars, lattice Pauli-Vil- 
lars and the conventional O(N) model. For this choice of the correlation length, the 
lattice effects are small and the Pauli-Villars theory shows much stronger interaction 
when compared with the conventional O(N) model. We have modified the naive 
large N formula so that the right group theory factor is substituted, i.e. N + 8 = 12. 

of the interaction in the symmetric phase. In Figure (3.6) , the comparison between 
the two theories is shown for m = 0.3, M = 1.0 for every given A . The lattice 
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summation is calculated on a 32 4 lattice with the naive lattice discretization of the 
Laplacian. The continuum Pauli-Villars result is also shown in the figure. For this 
choice of the correlation length, the lattice effects are rather small in both theories. 
It is clear that the Pauli-Villars theory has stronger interaction, about a factor of 4, 
for the same correlation length when compared with the conventional O(N) model. 
As mentioned above, the large N expansion has its own ambiguities, so we do not 
anticipate this large N result to give us a precise quantitative description of the 
theory. However, we do expect that the increase of the coupling constant in the 
Pauli-Villars theory relative to the conventional theory should also be present in the 
full theory. Similar results in the broken phase also support this picture, as we will 
see in the next subsection. 



3.2.3 Higgs Mass and Width for Conventional O(N) Model 



Now, we will examine the situation in the broken phase of the theory. We 
will first briefly review the large N result for the conventional O(N) model with 
a hypercubic lattice regulator [9, 10] in the broken phase. The large N Goldstone 
propagator will remain the free propagator in the leading order of 1/7V expansion, 
but the large N Higgs propagator will be 



In general, the lattice bubble is a very complicated function of p. However in the 
limit of small p 2 , which is the regime of physical interests, it has been worked out 



r CT .(p 2 ) 




8X v 2 



l+4A V5(p 2 )' 




(3.34) 
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and has the following asymptotic expression [8] 



B{p 2 ) 



1 



(-\ og (p 2 ) + c latt + 0(p 2 )), 



(3.35) 



16tt 2 



where the constant ci aU = 5.79200957 for the hypercubic lattice. We can now use 
this relation to find the complex pole to the Higgs propagator. Since the higher order 
terms are neglected in the bubble, it is sufficient to keep only the leading term in the 
lattice momentum. Setting (— p 2 ) = s = [m H — iY H /2) 2 = r 2 e~ 2td , we have (taking 
the second Rieman sheet value for the logarithm) 



The phase 9 is first determined from the second equation and then substituted into 
the first one to get the real and imaginary part of the Higgs pole. The result is 
summarized in Figure (3.7) . For the Higgs correlation length of about 2, the ratio 
m H /v is only about 3. The correlation length 2 is chosen because if the Higgs is 
too heavy in lattice units, the lattice effects would become significant and the theory 
would no longer describe continuum physics [8]. 

Another feature that we can study is the width of the Higgs particle. There 
has been quite a lot of confusion even with the conventional O(N) model. The large 
iV width of the O(N) model was first carefully studied by Einhorn using a sharp 
momentum cutoff [10]. He found that the large iV formula, if N = 4 is substituted 
in, gives too large a width (40 percent larger) when compared with the perturbation 
theory of the 0(N = 4) model. Therefore, the large N results seem to indicate 
that the theory is more strongly interacting than the perturbative predictions. Some 
authors interpret this finding as genuine nonperturbative effects of the model [9]. 
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Figure 3.7: The large N result of the Higgs mass over vev ratio mn/v as a function 
of the bare coupling constant for the conventional O(N) model with a hypercubic 
lattice regulator. Four curves correspond to different v values (in lattice units) as 
indicated. N has been set to 4 in the calculation. 

However, we do not think this is true for the following two reasons. First of all, 
the large N result of the width does not agree with perturbation theory, even for 
very weak couplings. In this regime, the next to leading term of the width has been 
calculated in perturbation theory. The correction is very small, typically of the order 
of one percent. Therefore, it is unlikely that even higher order terms will change this 
perturbative result significantly. Secondly, the perturbative result has been proven 
to be correct by extensive nonperturbative Monte Carlo simulation studies. No 
mysterious nonperturbative effects as predicted by large N have been found. 

We believe this discrepancy is because of the ambiguity within the large N 
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framework. The large N result can only be accurate to about 20 to 30 percent 
because of the large subleading 1/N terms at iV = 4. After all, iV = 4 is too far 
from N = oo. This has been previously pointed out by Kuti, et. al. [11]. More 
importantly, we have over estimated the decay channel of the Higgs particle in the 
naive large N formula. At N — 4, the Higgs particle can decay into 3 colors of 
the Goldstone pairs while the large N formula counts 4. When we take this into 
account and substitute (N — 1) for N in the large iV formula, we expect compatible 
results with the perturbation theory. In Figure (3.8), we have plotted the two large 




m H (TeV) 

Figure 3.8: The large N results for the width of the Higgs particle as a function of 
the Higgs mass is shown. The open squares are the naive large N prediction for 0(4) 
model. The open hexagons are the large N results after the number of decay channels 
has been corrected. The solid line is the leading order perturbation result and the 
dashed line is the perturbation result up to the second order. The corrected large 
N width agrees with the perturbative prediction very well in the weakly interacting 
regime as it should. The naive large N result overshoots by about 30 to 40 percent. 
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N results of the width as a function of the Higgs mass and compared them with the 
perturbative results. As expected, the corrected large N width agrees reasonably well 
with the perturbative predictions, especially in the weakly interacting region or small 
Higgs mass. The naive large N result, however, overshoots by about 30 to 40 percent 
simply because it fails to identify the correct number of decay channels of the Higgs 
particle. From this calculation, we conclude that the large N approximation has its 
own ambiguities when it is applied to N values that are not very large. Therefore, 
one must modify the naive large N formula in order to get meaningful quantitative 
results. 

3.2.4 Higgs Mass and Width for Higher Derivative O(N) 
Model 

In higher derivative theory, things are getting more complex because of the 
ghost pair. One could try to evaluate the continuum bubble integral in Equa- 
tion (3.29) and solve for the complex pole of the Higgs propagator. Note that this 
integral is finite and no regulator has to be introduced. The precise form of the bub- 
ble integral is very complicated and is listed in the Appendix of this chapter. The 
analytic structure ( Riemann sheets and cuts ) are also quite complex, as described in 
the Appendix. The resulting function is then substituted into the full large N Higgs 
propagator to solve for the poles. The complex pole structure of the function is also 
very complicated due to the existence of the ghost states. The poles are numerically 
searched for a given parameter v in M unit and a fixed value of the bare coupling 
constant A . The result is shown in Figure (3.9). One has to be careful with the Rie- 
mann sheet structure of the function in order to get the right result. The poles are 
characterized by their positions on the Riemann sheets. On the first Riemann sheet, 
due to the ghost states, one finds a conjugate pair of poles represented by the open 
hexagonal points in the figure. They are moving towards the higher energy values 
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Figure 3.9: The complex poles of the large iV Higgs propagator is shown on the 
first and the second Riemann sheets. The bare coupling constant is set to infinity 
in this figure. The open hexagonal points represent the ghost pair poles on the first 
Riemann sheet. The filled hexagonal points are the 'image' of the ghost on the second 
Riemann sheet. The filled circles are the Higgs poles on the second sheet. The size 
of the points reflects the different v values. 

as the interaction is turned on. These complex conjugate ghost pairs have "shadow 
images" on the second Riemann sheet which are represented by the filled hexagonal 
points. Because of the interaction with the Higgs pole on the second sheet, these 
poles are not moving symmetrically. The conventional Higgs poles are on the second 
Riemann sheet, represented by the filled circles. As the vacuum expectation value 
is increases in M unit, the Higgs pole is moving towards the higher energy range. 
When the Higgs pole is at very low energy and far away from the ghost poles, the 
effects of the ghost states can be viewed as an effective cutoff to the conventional 
theory. This can also be seen in Equation (3.29). When p 2 is small, the bubble 
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integral becomes very simple and can be very well approximated by 

1 



B(p 2 ) 



16tt 2 



log(p 2 ) + l/2). 



(3.37) 



But, if the Higgs pole is getting closer to the energy scale of the ghost poles, the 
higher derivative theory feature has great importance and viewing the ghosts as the 
effective cutoff to the conventional theory becomes meaningless. 

Identifying the real part of the Higgs pole with the mass parameter and 
the imaginary part with the half width, we can plot the ratio m H /v as a function 
of the bare coupling constant, which is shown in Figure (3.10). In this figure, we 
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Figure 3.10: The large N result of the ratio m H /v as a function of the bare coupling 
constant for various values of the vacuum expectation value (measured in M units) 
for the higher derivative O(N) theory. The maximum ratio saturates to about 4 at 
infinite bare coupling constant. 
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have selected 4 different vev values and the ratio saturates to about 4 when the 
bare coupling constant is brought to infinity. When we set the physical value of the 
vacuum expectation value to 250 GeV, this implies a Higgs particle with the mass 
m H = 1 TeV. This should be compared with the result of the conventional 0{N) 
model discussed earlier in Figure (3.7). Although the absolute values of the Higgs 
mass may be somewhat ambiguous due to the large N approximation, this result 
indicates there is a 30 percent relative increase in the Higgs mass over vev ratio 
when the Pauli-Villars theory is compared with the conventional O(N) model on 
the hypercubic lattice. So we would expect the full Pauli-Villars theory should also 
generate a larger mu / v ratio compared with the conventional theory. Recall that the 
Higgs mass bound for the conventional theory is about 750 GeV (which is a ratio of 
3), we expect the Pauli-Villars theory could have a heavy Higgs particle in the TeV 
range. In fact, this hint from the large N expansion initiated our nonperturbative 
study of the Pauli-Villars theory [6]. As we will see in the coming chapters, this 
scenario of strongly interacting Higgs sector in the Pauli-Villars theory is confirmed 
by our nonperturbative simulation results. 

We can plot the width of the Higgs particle as a function of the Higgs mass, 
just as we did for the conventional theory. In Figure (3.11), the similar plot for 
the higher derivative Pauli-Villars theory is shown. One has to again modify the 
naive large N results for the right decay channels. At very low energy, this result 
agrees with the perturbative result, which means that the theory can be viewed as a 
Pauli-Villars regulated conventional theory. However, when the Higgs mass is getting 
heavier, it deviates quite rapidly from the perturbative result, and in this range, the 
higher derivative theory does not resemble a regulated conventional theory. 
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Figure 3.11: The large N result for the width of the Higgs particle as a function 
of the Higgs mass is shown in the Pauli-Villars higher derivative O(N) theory. The 
open squares are the naive large N prediction at N = 4. The open hexagons are the 
large N results after the number of decay channels has been corrected. The solid line 
is the leading order perturbation result and the dashed line is the perturbation result 
up to the second order. The corrected large N width agrees with the perturbative 
prediction very well in the weakly interacting regime as it should. The naive large 
N result overshoots by about 30 to 40 percent. 

3.3 Perturbation Theory of the Higher Derivative 
0(N) Model 

The renormalized perturbation theory of the higher derivative scalar O(N) 
model can be established in the usual way, except we pay special attention to the role 
of the ghost pair. In the low energy regime, we expect to recover the conventional 
theory. When the energy scale is increased, one should see the ghost pair begins to 
play a more important role. To incorporate this energy dependence, a mass depen- 
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dent renormalization scheme is needed. We now illustrate this briefly by considering 
the model in the broken phase. 

3.3.1 Lagrangian and Renormalization Conditions 

Let us consider the following Euclidean Lagrangian, 

Ce = ^ a (-n-^)0 a -^V0 a + A(0 a a ) 2 , 

Ft 7 F7 D 3 Fu 2 

sc E = Yr(-o)r + ^n- W4 )r-^rr + sx(rr) 2 - (3.38) 

We can define the bare fields and bare parameters according to 

<Pl = z{ /2 <t> a , Z 1 = 1 + 5Z 1 , 



M- 4 = |^M- 4 , Z 3 = 1 + 5Z 3 , 



ii\ = (/i 2 + 5 / i 2 )/Z 1 , 

A = (\ + 5X)/Zl (3.39) 
Then, the total Lagrangian can be written as 

£e = ^(-D - ^Wo - + Ao(0^) 2 - (3-40) 
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In the broken phase, it is convenient to separate the Higgs field and the Goldstone 
fields as 

/ \ 



(3.41) 



n N -\x) 
v + a(x) 

where v = /i 2 /4A is the renormalized vev. We can then write down various propa- 
gators to one loop order and impose the following mass dependent renormalization 
conditions, 



V 



Sv 



o, 



— F"V)l^o = 1, 



(3.42) 



Notice that the above renormalization conditions uniquely determines the four renor- 
malized parameters. The arbitrary scale k is introduced to avoid the infrared diver- 
gences. All the renormalized parameters will depend on this running scale through 
the above definitions. It is easy to fix the counter terms according to the above 
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equations, 

SZ 1 = Am 2 ^(0), 



M~ 4 5Z 3 = ^BZ(M% 



^ = 36fi CTCT (/t 2 )+4(iV-l) J B_(/ t 2 ), 
5m 2 5X T u iV-ir ,„ 

" " = T + ^ + — rf' (3 - 43) 



where the bubble integrals B aa ,B an , ,B n7T and the tadpoles T a ,T n are listed below: 

d A k 1 



B aa 



« = / 



B a7T {p 2 ) 



(2tt) 4 (k 2 + m 2 + k 6 /M 4 )((k - p) 2 + m 2 + (k- pf/M A ) ' 

d 4 fc 1 

(2tt) 4 (fc 2 + m 2 + k 4 /M 4 )((k - p) 2 + [k - p) 6 /M 4 ) ' 

rf 4 A; 1 

(2tt) 4 (P + k A /M A )((k - p) 2 + (fc - p) 6 /M 4 ) ' 

d 4 £; 1 



f dk 

J (2vr) 4 A; 2 + A; 4 /M 4 + m 2 ' 

7 (2tt) 4 A; 2 + A; 4 /M 4 ' ( ' 

3.3.2 One-loop Mass-dependent Beta-functions 



Now we can work out the one loop mass dependent /5-function of the theory, 
which is obtained by noticing that the bare coupling constant A does not depend 
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on the renormalization scale k. The result can be written in the following form, 
l a 9A / ^ i x(1-x)k 2 



A 27r 2 \j~f jo x(l — rr)/t 2 + a;Aj + (1 — x)Xj 

N ~ 1 4^ uu f\ x(1-x)k 2 \ 

+ ? hbj / dx— - 1 — ; — , (3.45) 

9 .^o 3 Jo x(l - x)k 2 + x& + (1 - x)Sj) y ! 

where k 2 = k 2 /M 2 and the mass parameters Aj, ^ and their corresponding residues 
are determined from the following decomposition: 

1 111 

+ 



k 2 + k 6 + (m/M) 2 k 2 + \ k 2 + A x fc 2 + A 2 ' 

The important feature of these coeffecients is 

^^ = ^ = 0. (3.47) 

i=0 i=0 

Now it is easy to see how the effective coupling constant evolve with the energy 
scale k. When k/M is very small, the ghost pair contributions to the beta-function 
is negligible. The summation in the beta-function reduces to only the i — j — 
contribution, which is the conventional, well-known beta-function of the O(N) 
model in the broken phase. As k/M increases, the ghost contributions become 
increasingly important. When the energy scale is well above the ghost scale, the 
integral in the beta-function reduces to 1 and the quantity in the bracket vanishes 
due to Equation (3.47). This means that, at high energies, beta-function of the theory 
vanishes. Therefore, as the energy scale increases, the running coupling constant \(k) 
also increases. However, at the scale of the ghost pair or higher, the coupling constant 
gradually flattens out to some finite number. This is a very different feature when 
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compared with the conventional O(N) model. In the conventional O(N) model, the 
running coupling constant keeps increasing and becomes divergent at the so-called 
Landau ghost energy scale. In our higher derivative theory, we have replaced the 
Landau ghost with real ghost pair and the running coupling constant will remain 
finite for all energies. 

3.4 Appendix 

In this appendix we list the explicit form of the bubble integral and discuss 
some analytic properties of such. 

The function is given by the parametric integral representation 

— 1 r 1 

B(s) = ~ V" CiCj I dx\og[xm 2 + (1 — x)m 2 — x(l — x)s], (3.48) 

16~- ^ Jo 

where the sum over i and j runs from to 2 with the following values of q and m 2 

ml = 0, c = 1, (3.49) 



2iB 

— to 



,2 _ / • 2/H /( 



2 sin 20 



_|_ 7 >+2»© 

mi = e 2i °, c 2 



2 sin 26 

The integral can be worked out explicitly with the result 



1 - s/AM 2 ) (s/AM 2 ) Jl - s/AM 2 + J-s/AM 



+ 2i0 " n, U K AK l0 §( 



12 



\2 



2(s/AM 2 ) yl - s/AM 2 - js/AM* 
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+ 4 



;s;h v "I - s/AM 2 ){-s/AM 2 ) y/l - s/AM 2 + /s/AM 2 )2 
2(s/47W 2 ) y/l - s/AM 2 - ^j-s/AM 2 



M 2 s 

+ 2c oCl [+2iB + (1 - —) log(l - j^) ] (3.50) 



+ 2c c 2 [-2i0 + (1 ) log(l - =^r) ] + 2c lC2 f(s) } , 

s M 

where the last term is the ghost-antighost contribution and the function f(s) is given 
by 

isin29 / ,s-2cos20 + A(s). , . -2M 2 A 

f(s)= ^( l0g( ~M )+log( g -2cos29 + A( g ) ) j 



A(s) / / s-2cos26- A(s), , .s - 2 cos20 - A(s) x \ 

^l log( ) -' og( >] ' 



A(s) = v /(s-4cos 2 0)(s + 4sin 2 0). (3.51) 

The logarithm functions in the above equations take the complex angle between n 
and — 7r. The function /(s) was worked out long ago by Lee and Wick but our 
results is different from theirs [2]. Their results correspond to combining the two 
logarithms in the above equation, which is not always legitimate because of the 
restricted range of the complex phase of the arguments under the logarithms. The 
Rieman sheet structure of this function is highly nontrivial as shown in Figure (3.12). 
Our function agrees with Lee's function when Re(s) < 2 cos 20. The function /(s) 
has a cut which is a hyperbola whose center is at (2 cos 20, 0) in the complex s plane. 
The function f(s) has a finite jump anywhere across the cut except at s = 4 cos 20 
where it is continuous. This function is analytic everywhere else away from the cut. 
The so-called "ghost-antighost threshold" is not a real one, and no imaginary part 
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Figure 3.12: The Rieman sheet structure of the function B(s) is shown. The filled 
squares represent the starting points of different cuts. The hyperbola cut is due to 
the ghost-antighost contribution f(s). 

contribution will arise when the center of mass energy steps through 4 cos 2 0. This 
is necessary for the unitarity to hold. Other parts in the B(s) function have cuts 
starting at the ghost pole location. In the small s and large s region the function 
B(s) simplifies to 



B(s) 



16tt 2 



(log(s)-i7r + l/2 + 0(s)), 



Bis) ™ 



16tt 2 



'-m + 0(l/s)). 



(3.52) 



This concludes our discussion of the analytic properties of the function. 
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Chapter 4 

Higher Derivative Field Theories 
on the Lattice 

4.1 The Naive Lattice Action and Phase Diagram 

The need of a lattice for the higher derivative scalar field theory presented 
in the previous chapter is not for the purpose of regularization, but rather, to make 
the degree of freedom finite so that a nonperturbative study of the model can be 
performed in computer simulations. The lattice spacing a introduces a new short 
distance energy scale with the associated momentum cutoff A = ir/a. In order to 
recover the higher derivative field theory in the continuum, we would have to work 
towards the A/M — > oo limit with a fixed ratio of Mjmu- The lattice action we 
choose to study [1, 2] is 

C E = -Kj>(x)(-n - ^)4>{x) + (1 - 8K)<j>(x) 2 - X^xf - l) 2 , (4.1) 

where the □ is the lattice Laplace operator. The phase structure of this lattice model 
is quite similar to the conventional 0(N) scalar field theory. It has two phases as 
shown in Figure (4.1). The O(N) symmetric phase is separated from the broken 
phase with residual 0(N — 1) symmetry by a second order phase transition line for 
every value of the lattice coupling constant A in the (k, M) plane. Near the critical 
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Figure 4.1: The phase diagram of the lattice model at infinite bare coupling. Data 
points are obtained from Monte Carlo simulations. The dotted line is calculated in 
the large-N expansion. The solid line displays a fixed Mr/tjih ratio towards the 
continuum limit of the higher derivative theory. 

line, we expect to recover the continuum theory without the lattice artifacts. How- 
ever, the critical behavior of our model is more complicated than the conventional 
O(N) model. It can represent different universal continuum theories along different 
paths towards the critical line. Tuning the value of k towards the critical line for 
any fixed value of M corresponds to the trivial field theory in the continuum. In 
this limit, the operator 0D 3 becomes irrelevant in the critical region. However, if 
we tune the value of k towards the critical line in such a way that the ratio Mr/ run 
remains fixed, we will recover the continuum higher derivative field theory with the 
corresponding ratio of the ghost mass parameter and the Higgs mass. In this limit 
the operator 0D 3 cannot be viewed as an irrelevant operator [4] in the Lagrangian. 
Thus, it becomes clear, from the discussion above, that if we want to study the higher 
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derivative field theory, we have to work towards the second limit. 

In the practical application, however, this limit is not very easy to arrange. 
One reason is that if we want our results to represent the continuum results, we have 
to keep the ghost mass parameter M reasonably small in lattice units in order to get 
rid of the lattice effects associated with it. On the other hand, we need to put the 
Higgs mass below the ghost mass parameter. Therefore, we are very restricted in 
the parameter space. On the one hand, making the Higgs mass smaller will lead to 
huge finite size effects for the practical lattice sizes; on the other hand, making the 
Higgs mass larger will push up the ghost mass and will result in large lattice effects. 
So we have a rather narrow range in the Pauli-Villars correlation length M/tuh- 
Typical values we took in the beginning of our simulation were: M = 0.8 ~ 1.0, 
m = 0.3 ~ 0.4. This, of course, was unsatisfactory because the ratio M/m H = 2 ~ 3 
is too narrow of a range. If we view this theory as a Pauli-villars regulated theory, for 
example, we would hope to see the conventional scaling behavior in the large M/rnn 
limit. It turns out that the scaling form may apply only for rather large M/rnn values 
which is impossible for us to investigate using this naive lattice action. Also, due to 
this restricted range, it was also impossible for us to study the scattering phase shift 
profile of the model. This type of analysis offers us a very good way of extracting 
the mass value for an unstable particle in the finite box ( see Chapter (6) for full 
discussion ). This restriction in the parameters is purely due to the introduction 
of the underlying lattice structure. When we were able to eliminate most of the 
lattice effects, we were then able to enlarge our parameter space quite substantially. 
Therefore, the need for an improved lattice action becomes quite obvious. 
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4.2 The Improved Lattice Action 

Improving the lattice action so that it has better Euclidean invariance was 
studied long ago [3]. Our choice of the improvement corresponds to modifying the 
lattice Laplacian so that it resembles the continuum Laplacian. Therefore, we take 

P 2 I = P 2 + a 1 Y,K + ^T,pt + ^T,pl + ^ + «6 E^i 2 + «« E^ 4 > (4-2) 

fi fi /j, fi fi /j, 



where the coefficients are given by the following table 
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In fact, we calculated the renormalized coupling constant in the large N limit 
and we found that this improved action significantly decreased the lattice effects. 
With this improved lattice action, even at M — 2.0, there was negligible lattice 
effects on the large N results. The phase diagram of the improved action is similar 
to the naive action. 

The improved action offers us another power: the possibility of performing 
a phase shift simulation on the higher derivative O(N) model. This is the subject 
in Chapter (6). As we will demonstrate, without the improved action, we are in the 
parameter range that is impossible for this type of simulation because we would need 
unrealisticly large lattices to extract the phase shift. With the improved action, this 
type of calculation becomes possible. 
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4.3 The Rotator States and Born Oppenheimer 
Approximation 

Studying the higher derivative O(N) model in the broken phase and the 
corresponding Higgs mass problem requires a better understanding of the symmetry 
breaking mechanism in the finite volume. In fact this is already an important issue 
in the conventional O(N) theory without the higher derivative terms added. The 
symmetry breaking mechanism has been understood very well in the infinite volume 
limit. However, it has not been answered satisfactorily in the O(N) model in a finite 
volume. 

There are several complications. First of all, the notion of symmetry break- 
ing in the infinite volume cannot be applied to a system in a finite volume. Strictly 
speaking, in a finite volume, the symmetry is never broken. Secondly, it turns out 
that the dynamics of the zeromode are crucial for the understanding, and the zero- 
mode is coupled to other modes in a complicated way. For the one component 4 
theory, Hartree type of approximation will give us a very good description of the 
symmetry breaking. For the O(N) model, extra care must be paid to the motion of 
the zeromode and new approximation schemes are needed for the understanding of 
the problem. This section consists of several parts. In the first part, we will review 
what is known to the symmetry breaking in a ordinary one-component 4 theory in 
the broken phase. It turns out that this is a very instructive model to study. In the 
second part, the conventional O(N) model is studied in the broken phase. Here we 
introduce the Born- Oppenheimer Approximation (or Adiabatic Approximation) and 
fully investigate the dynamics of the zeromode. In the third part, we consider some 
important applications of the Born- Oppenheimer Approximation. The machinery is 
applied to the ground state and higher energy excited states. The rotator correction 
to the energy of these states is calculated. This will serve as a theoretical guide line 
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to the analysis of the simulation results in Chapter (5). Then, the higher derivative 
O(N) theory is presented in the next section. 

4.3.1 Symmetry Breaking of the One-component 4 Model 

Consider the one component 4 theory in a cubic box. The Hamiltonian of 
the theory is given by 



This Hamiltonian is obviously invariant under the change — > —0. That is to say 
that one can construct a parity operator P, which flips the sign of the field and it 
commutes with the Hamiltonian. Therefore, all the eigenstates of the Hamiltonian 
can be chosen to have a definite parity. 

We can build up two approximate ground states of the Hamiltonian |±), 
which are Gaussian wavefunctions centered at ±v respectively. This picture is very 
well illustrated by the so-called "Hartree approximation" . We start with a trial wave 
functional which is a Gaussian 



where v,G _1 (x, y) are variational parameters and the summation over x and y is 
implied. The approximate ground state of the system can be found by using the 
minimization condition of the energy. This condition in the broken phase will give 



us two solutions for the parameter v, namely, v = ±y//o/4Ao and the propagator 

G~ 1 {x,y) is given by 



(4.3) 




(4.4) 





(4.5) 
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If we denote these two states as |±) then we see they satisfy the following properties 

P|±) = | T ), (+|_) = e -«W 3 . (46) 

Note that the states |±) are not orthogonal to each other in the finite volume. 
The true ground state and the first excited state are given by the symmetric and 
antisymmetric linear combination of these two states 

|0> = ^(|+> + h», |l> = ^(|+>-|-». (4.7) 

The true ground state is a parity even state, while the first excited state is a parity 
odd state. The energy difference between the two is exponentially small when the 
volume is large. This means that if the system were started at one of the minimum, 
after a long enough time, there is a finite probability of finding the system tunneled 
to the other minimum. The typical time scale for this is 1/AE, where AE is the 
energy difference between the ground and the first excited state. 

If we use a infinite volume, the state |+) would be exactly orthogonal to the 
state |— ); then the system starting from one particular minimum of the potential as 
the true vacuum will stay there, without knowing the other one and the <fi — > — <fi 
symmetry is broken. However, in a finite but large volume, the system will stay 
around one minimum for such a long enough time that we may say the symmetry is 
"almost broken". 

In the one component model, the symmetry is a discrete symmetry and the 
Hartree approximation gives us a very good understanding of the symmetry breaking 
mechanism in the finite volume. Nonperturbative works have also been done to 
measure the energy gap between the ground state and the first excited state, which 
is related to the surface tension of the system. 
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4.3.2 0(N) Model: General Setup 

The situation is much more complicated when we try to do a similar anal- 
ysis for the O(N) model. The main reason is that the symmetry is a continuous 
symmetry, therefore, the dynamics of the zeromode is much more complicated. 

We could first try out the Hartree approximation, but it will not give us 
the right energy spectrum of the theory. This is because the Hartree approximation 
treats every mode of the system equally. In the one component model this is valid, 
but it is not valid for the O(N) model. In the O(N) model there exists one special 
mode, that is, the direction of the zero Fourier mode which can be characterized by 
an O(N) unit vector. In a large but finite volume, this mode is a slow varying mode 
when compared with the other modes. It is the counter part of the parity operator 
in the one component model. The only difference is that, in the one component 
model, the parity only takes discrete values and is not dynamical. In the O(N) 
model, however, this unit vector lives ona(#- l)-sphere and has its own dynamics. 
Therefore, we expect that the Born-Oppenheimer Approximation (BOA), also known 
as the Adiabatic Approximation, will give us a very good description of the zeromode 
dynamics. 

The Born-Oppenheimer Approximation was first introduced in the study of 
diatomic molecules. In the molecular problem, there are two types of degrees of 
freedom. The motion of the electron is called "fast" , and the motion of the nucleus 
is called "slow" . Therefore, when solving the energy eigenvalues of the system, one 
should first nail down the slow variable, namely the configuration of the nucleus, 
and solve the fast variable spectrum. In this step, the configuration of the slow 
variable is treated as an external field. The eigenvalues and eigenstates that come 
out will, in general, depend on the prescribed configuration of the slow variable. 
These eigenvalues are then taken back into the Schrodinger equation for the slow 
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variables as the effective potential, which reflects the feedback of the fast variable to 
the slow variable. Finally, the Schrodinger equation for the slow variable is solved 
to get the spectrum of the molecule. 

The spectrum of the molecule has a three fold hierarchy: electron energy, 
oscillation energy and rotational energy. These energy gaps are characterized by 
different powers of a small parameter, which is the ratio m e /m N , where m e is the 
mass of the electron and m N is the mass of the nucleus. Born-Oppenheimer is a 
very good approximation for the molecule, since this ratio is so small. It is not hard 
to imagine that a Hartree approximation to the molecular problem would be a poor 
choice, since it treats the electron (fast variable) and the nucleus (slow variable) 
equally, while ignoring the enormous difference in the mass of the two. Similarly, in 
our application, Hartree is a poor approximation for the same reason. To get the right 
picture, one has to separate the special zeromode and use the Born-Oppenheimer 
type of approximation. 

In our model, we will treat the direction of the zeromode as the only slow 
variable. We will use the same Born-Oppenheimer type of spirit to solve for the 
energy levels of our model. 

We begin with the Hamiltonian 



where for convenience we have discretize the system on a cubic lattice. The operator 
7r a (x) is just the derivative operator (—i)d/d(j) a (x.) in the field variable diagonal rep- 
resentation. Two classes of symmetry operators that commute with this Hamiltonian 
are very important. First, there are global O(N) symmetry generators Q ab given by 




(4.8) 



Q ab = £0 a (x)7r 6 (x)-0 6 (xK(x) 



(4.9) 



X 
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We also have the 3-momentum operators 

P t = ]T[0 a (x + e ; ) - a (x)]7r a (x), (4.10) 

x 

where e ; is the unit vector in the i direction. It is trivial to verify that these operators 
commute with the full Hamiltonian hence are symmetries of the theory. 
We now introduce the Fourier modes of the field variable, 

a (x) = F + E ^ k ' x + Ce" lk - X , 
v V k>0 

= H) 9 + Hy -ik.xj_ + +ik-x^_ / 411 n 

where V = L 3 is the 3-volume of the box. As we mentioned above, the zero mode 
(f) a plays a very important role in the broken phase. Therefore we have singled out 
this mode from the nonzero momentum modes. Let us define: 

(f) a = ( v + cr)n a , n a n a = 1, 



pf = n a n\ P« h = 5 ab -n a n\ (4.12) 

where v = /!q/4A is the vev of the theory. In the Born-Oppenheimer type of ap- 
proach, we will treat the direction of 4> a , namely n a , as the only slow varying variable 
and treat the rest as fast variables. The justification of this will be seen shortly. Then 
the Hamiltonian can be expressed in terms of these Fourier modes. 

We use the radial variables for the mode <p a . Thus we write the wavefunctional 
of the system as ^ = p~( iV ~ 1 )/ 2 -?/> and the effective Hamiltonian for ip will contain 
only the second derivative with respect to p. For the nonzero Fourier modes, let us 
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introduce the creation and annihilation operators as 



■at 



1 ^(v^c - -4^), 



\J A V ii, k L/ rk 



14 90,* 1 ' 



(4.13) 



where f2 k = \Jitiq + k 2 is the higgs excitation. We can define the Higgs creation and 
annihilation operators as 

1 



K = n a Ll hi = n a Ll\ a 



--(h + hl). 



(4.14) 



Similarly, we can define the transverse Goldstone creation and annihilation operators 



as 



= -^p?(V^K* 



i a 

u\d(f)' 



»)■ 



6* )' 



(4.15) 



where u; k = |k| is the Goldstone energy. In terms of these operators, we can rewrite 
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the effective Hamiltonian in the following form 

L 2 + A N 



^ 2V(v + *)>- 



H int = E4A o ^(/i 2 + 0i0i) + A o (/i 2 + 0l0i) 2 , (4.16) 

x 

where the fields h(x) and are given by 

Mx) = E^^ + ^e— ), 

0l(x) = £^^(T k V k ' x + T k a V* k - x ), (417) 

and the constant A N = (N — 3)(iV — l)/4. The operator L 2 = is the O(JV) 

Casimir of the zeromode variable. The above creation and annihilation operators 
satisfy the following commutation relations 

Pk, = ^T^kp, [K, L b J] = Pf5 kp . (4.18) 

It is also very convenient to introduce the following decomposition for the fields. 
For a given O(N) unit vector n a , we can find additional N — 1 unit vectors which, 
together with n a , form a complete set in the O(N) space. We therefore define 

n a = n a , nln a p = S aP , a, (3 = 0, 1, ...N - 1, 
T k a = <T ik , T, k = <T k a , 

^k = ng/i k , /i k = ngL£. (4.19) 
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It is readily checked that these operators satisfy the standard commutation relations 

H, hi] = 5 kp , [T ik , Tjp] = S^. (4.20) 

Note that due to the leftover 0(N — 1) symmetry, the determination of the unit 
vectors is not unique. However, the physical quantities will not depend on this 
ambiguity. Moreover we can calculate the commutator of the operator Lg 6 with the 
unit vectors, 

[L*, <] = (-i)n^ c , a = 0, 1, ...N - 1. (4.21) 

Now we can set up a basis in our Hilbert space from the eigenstate of the free Hamil- 
tonian. We will also choose the angular momentum eigenstate of the n a variable, 
namely 

k {K, n l}, lm ) = \ n ) ® n l n k) ® \ n l) ® \ lm ), ( 4 - 22 ) 

where the state \lm) is the eigenstate of L 2 with eigenvalue 1(1 + N — 1). This is just 
a symbolic notation of the state. Strictly speaking, for O(N) model, we need more 
quantum numbers to specify the state. Note that the oscillator part of the state 
actually depends on the unit vector n a via the definition of the longitudinal and 
transverse projection, although the eigenvalue does not. Therefore, if we were to act 
the operator L 2 on the states above, it would not only act on the state \lm), but also 
act on the rest of the components ( except \n), of course, since it is the radial zero 
momentum mode ). As we will see below, the Born-Oppenheimer Approximation 
will first neglect the effect of L 2 on the fast modes and only consider the slow mode 
part of the state, i.e. \lm). The next order correction has to take this into account 
and the BOA is valid when the correction is small. 

The global O(N) generator Q ab now can be expressed in terms of the creation 
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and annihilation operators defined in Equation (4.19) 



Q ab • = Lt '- iE4Vl°4 (4-23) 

where the index a and f3 run from to (N — 1). The operators T ok and the functions 
/k, 9k are defined by the following: 

^Ok = /k^k ~~ <?k^-k, ^Ok = /k^k — <?k^_k> 

/ k = I(/5 + /^L) ^ k = I(/5_ /^) (424) 

2 y CJk V ^k 2 y Wk V ^k 

We will need the result of L 2 acting on the oscillator states. Let us consider the 
object L 2 |Ok^o)- This can be easily obtained by noticing that the state is annihilated 
by the global O(N) generators Q ab . Therefore we would induce that 

K^o) = E i9*n [ on%l k Tl\0^ ). (4.25) 

k^O 

We will also need the commutation relations between and the annihilation oper- 
ators 

[L% b ,L k ] = (-t)(nH b ^h k + n%f k n^+g k n^T b Jl)), 

[<\T k c ] = H)(nH b Hhh k -g k hl k )+n c n^), (4.26) 
or equivalently, in terms of operators T ik and h k , we have 
[Okl = HWonf(hT zk + g k Tl k ), 

[Lf,T ik ] = (-O^'njT^ + C-OnfnJC/k/ik-flt/iU). (4.27) 
The corresponding commutators with the creation operators can be obtained from 
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hermitian conjugation of the above equations. We now have all the tools to study 
the rotator energy spectrum. 

4.3.3 0{N) Model: The Ground States 

The full ground state of the free Hamiltonian consists of two parts. One is 
the ground state of oscillator states. The other part is the rotator states. 

\G,lm) = \0 OSC ) <g> \lm). (4.28) 

Note that, before the rotator energy contributions are taken into account, the de- 
generacy of the ground state is infinite, since any rotator state \lm) will belong to 
the same energy. It is very easy to check that all these states are eigenstates of the 
momentum operator with eigenvalue of 0. They can also be taken as the eigenstates 
of the appropriate 0(N) operators. To see this, notice that when the operator Q ab 
is applied to the states, it generally has two contributions. One is from Q ab acting 
on the oscillator state |0 OSC ), which is zero in this case; the other is from Q ab acting 
on the rotator states \lm), which is equivalent to L^\lm). Therefore, we have 

Q ab \G, Im) = \0 OSC ) <g> L ab \lm). (4.29) 

By taking the states \lm) to be the eigenstates of the zeromode Casimir, we also make 
the ground states to have the appropriate O(N) charges. Basically, the oscillator 
ground state has O(N) charge 0, and all the O(N) charges comes from the rotator 
states. 

As mentioned above, the leading order ground states are infinitely degenerate 
due to the rotator states. This degeneracy is lifted once the first nonvanishing rotator 
correction is taken into account. To do this, let us evaluate the matrix element of 
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the rotator energy operator (l/2)LQ b L[f w r among the ground states 
L% b \0 osc )®\lm) = (L% b \0 osc )) <g> \lm) + |0 OSC ) <g> (Lg*|Zm)) 



= £ ^knM/il^lO^e) <g> |/m) + |0 OSC ) ® «|/m)), (4.30) 
where we have used Equation (4.25). Therefore we get 



(G,l'm'\l/2L ab L ab uj r \G,lm) 



1(1 + N- 2)u r + (N- l)u r gl 



Sll'Smm'- (4-31) 



As expected, the degeneracy is lifted, and the ground states now have a degeneracy 
of I 2 , for a given / value. The second term in the above equation is an I independent 
constant and can be absorbed into the definition of the ground state energy. The 
first term is / dependent and is known as the rotator energy spectrum. This formula 
was derived before by Leutwyler using the rigid rotator approximation in the chiral 
Lagrangian formalism. The significance of this energy spectrum in the Monte Carlo 
simulation was also discussed [5]. The low energy excitations of the model exhibit 
a three hierarchy of energy gaps. The largest energy gap is the mass gap of the 
Higgs particle, whose energy is independent of the 3-volume. The second largest gap 
is the Goldstone particle, whose energy gap is typically of order 0(1/ L), where L 
is the size of the 3 dimensional cubic box. The smallest gap is the rotator energy 
differences between different / values, whose energy is of order O ( 1 / L 3 ) . In a practical 
simulation, the size of the Higgs gap and the Goldstone energy gap are of the same 
order, since the size of the box is not large enough. However, the energy gap of the 
rotator is usually much smaller compared with the Higgs and Goldstone. Therefore 
we expect the Born-Oppenheimer picture should be a very good description of the 
theory in the finite box. 

It is also possible to evaluate the second order correction to the ground state 



88 



energy. Let us first look at the following quantity 
L ab L ab \G,lm) = {L ab L ab \U osc ))®\lm) 

+ 2«|0 OSC » ® (K b \lm)) + |0 OSC ) <g> («|/m». ( 432 ) 

The first term in the above equation will contribute at the second order but it is 
a term independent of I. If we only focus on the /-dependent terms, we can forget 
about this term. The last term is diagonal, it will not contribute to the second order 
correction of the ground state energy. Therefore, only the second term will give us 
/-dependent contribution to the ground state energy. The state left over is 

2(^ 6 |0 OSC » ® W|Zm» = 2 ]T ig v ntn%{Tl^ osc )®(Lf\lm)). (4.33) 

k^0,a,6,i 

Therefore, it will contribute a second order energy correction that looks like 

4 2) = - E ^^{lm\ltfrk%H a r$L*\lm). (4.34) 

The matrix element that appears in the above equation can be simplified by noticing 

Ltfn^nMnUo = ^ b n b n c L a c . (4.35) 

We can pick our O(N) axis such that the unit vector is in the direction (0, 0, • • • , 1). 
Then the above operator simplifies to the difference of two Casimirs: the Casimir of 
the O(N) and the Casimir of the unbroken 0(N — 1) 

4? = "E + N-2)- {Im^l^lm)) . (4.36) 

This correction is usually very small for practical simulation parameters. However, 
there is another second order correction of the rotator Hamiltonian. Recall that we 
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can expand the rotator Hamiltonian into the form 



L 2 + A N L 2 + A N/ _ 2a 3a 2 



^_ = - - — + — + ...). (4.37) 

2V{v + af 2Vv 2 1 v v 2 ' K ' 

Note that the operator a — (h + hl)/y/2Vm . therefore we have a systematic 
expansion in inverse powers of the 3- volume. If we introduce the rotator energy oj t = 
l/2Vv 2 , then we have a expansion in terms of the small quantity uj r /m. Therefore, 
there is another contribution from the operator L 2 u r (a/v) 2 which is also of the order 
of us 2 . So we have 

E$ = [l(l + N-2) + A N ]u r ^ 



2< ) 2 n 2 

- T,-^(Kl + N-2)-(lm\L 2 0{N _ 1) \lm)). (4.38) 



The first term basically takes into account the nonrigid effects of the rotator. 

4.3.4 O(N) Model: The Zero Momentum Higgs States 

Let us consider the energy corrections to the state \n,Ok^o,lm). Since the 
zero momentum Higgs excitation is just the radial excitation which commute with 
the angular variables. Therefore the energy corrections to the state are very much 
like the corrections for the ground states. 

E<£> = (n+l/2)m, 

E$ = [l(l + N-2) + A N ]u; r , 

= (l(l + N-2) + A N )uj r (n + l/2) Qu: " 



m 
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- E^^(Kl + N-2)-(lm\Ll {N _ 1) \lm)). (4.39) 

k ^g u^k T S'k 

The second order correction is down by an extra factor of u r /m compared with the 
first order correction. However, due to the large numerical factor in front, the effect 
of the first term is still quite significant. In a practical simulation, the correlation 
function (n a (0)n a (r)) is measured and used as a way of extracting the vacuum ex- 
pectation value v. This correlation function will pick up the energy difference of 
Al = 1 states. In most of the old simulations on 0(4), the ratio uo r /m is very small 
and the rigid approximation of the rotator energy gives very reliable results. In our 
recent simulations on the higher derivative theories, this ratio is of the order of 10 
percent and the correction is noticeable. We would find the wrong v value if we did 
not include this correction. 

4.3.5 0(N) Model: Two Pion States 

We can perform the similar calculation for the two pion states. Let us take 
the isospin zero channel states (N — l) _1 / 2 T^ k T^_ k |0) <S> \lm). We have 

(l'm'\ ® (0|T,, k T i5 _ k LV^ k ^_ k |0) ® \lm)/(N - 1) 

= ll(l + N-2) + (N-l)^gl + 2fl + 2gl)u r 8 ll ,5 mm ,, (4.40) 

which implies that relative to the ground state the finite volume correction is 

A(2cu k ) = 2(/ k 2 + ^K. (4.41) 

This correction is also very small when we extract the two pion energy. For the 
simulation points where we extract the two pion energies, this correction is below 1 
percent and is therefore hidden in the statistical errors. 



91 



4.4 Symmetry Breaking of the Higher Derivative 
0(N) Model 

The similar analysis can be done with the higher derivative O(N) model. 
Having discussed the ordinary O(N) theory we will be very brief and only point 
out the differences. Many steps are also similar to the quantization of the higher 
derivative theory which was discussed in detail in Chapter (2). 

One starts with the general higher derivative Lagrangian which has a global 
O(N) symmetry 

c = - p 2 n 2 - p 3 n 3 )0 a + - (4.42) 

where □ = d\ — V 2 is the Minkowski space d'Alambert operator and the coefficients 
are parametrized as 

m\ m\ 1 1 m\ 1 

After the usual steps of indefinite metric quantization, and introduction of the Fourier 
modes, the Hamiltonian has the form ( see Equation (2.50) to Equation (2.53) for 
detail ) 

+ E <A + + -« + (Pi + 2p 2 k 2 + 3p 3 k 4 K k 7r 2 a * 

100 ^3 

+( Pl k 2 + P2 k 4 + p 3 k 6 )0? k 0? k + (p 2 k 4 + 3p 3 k 2 )^ k ^ k + + 



E^ + E^o(W) 2 (4-44) 
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Because we are now treating the system in a finite volume, we can no longer neglect 
the motion of the zeromode. Instead, following the idea of Born-Oppenheimer Ap- 
proximation ( or Adiabatic Approximation ), we will single out the direction of the 
0i° variable and make it the slow variable in our Born-Oppenheimer approximation. 
We can then decompose 

0« = vn a + /i(x)n a + 0? T (x), (4.45) 

and similarly for the 2 and 3 variables. The Hamiltonian is then written as sum 
of three types of terms 

H = H + Hk^ + H int , 



H = -^(^ 10 7T 20 + ^^30^30 + ^^20^20) + ^(y^S^S + *0203 + ~Y a )> 



^k#o = E <k<k + <^2k + + (Pi + 2p 2 k 2 + 3p 3 k 4 )7r 2 a k 7r 2 a * 

k>0 P3 

+ ( Pl k 2 + p 2 k 4 + p 3 k 6 + m 2 )0^ kL 0^ L + ( Pl k 2 + p 2 k 4 + p 3 k 6 )0? kT & 
+ (p 2 + 3p 3 k 2 )0^ k 0- + i0« k 0- + #-0^, 

#int = E 4A o^(^ + 0?t0?t) + Ao(^ + 0?t0?t) 2 - (4-46) 

X 

This Hamiltonian is identical to what we had in Chapter (2), except for the Ho piece 
( see Equation (2.55) ). For example, the k 7^ piece can be diagonalized in the same 
way as in Chapter (2). The interaction piece is also expressed as the creation and 
annihilation operators through the field variables. The H piece can be decomposed 
as follows in the finite volume. For convenience we use the rescaled variables given 
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by 



Pi = (PiV)- 1 / 2 ^, vl = ^A^ V% = (P3V)- 1/2 rc a 30 , 



q\ = (piV) 1 ^, q a 2 = \^ a 2 , q% = ( P3 Vf 2 r 3 , (4.47) 

V Pi 

and use the radial variables for 

<fi = \[frV{v + a)n a = pn a . (4.48) 
The derivatives for the qf are now substituted by 

<4 = H, £ + < 4 " -"•"■>*?■ (449) 

where the index a runs from 1 to N while the index a only runs from 1 to N — 1. 
The main difference lies in the derivative term with respect to the rotator variable 
n a . In Chapter (2), this was neglected because we were in the infinite volume. This 
term practically serves as the kinetic energy of the zero mode variable n a . One can 
establish the following identity 



IP1P2 = ip2LPi P - ip a 2T -K\ (4-50) 



where Lq is the generator of the variable qf only, i.e., 

* = -«W>- (451) 

With these transformations, H is further decomposed into three parts 
Ho = Hql + Hqt + Holt 



tt ■ 1 2 1 2 P2 2 • [Pi m 2 

H 0L = tp 2L Py + -P2L + oP 3 L + 7^%L + H ~ ?2LftL + T^V > 

z i zp 3 y p 3 zpi 
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1 1 I 

2 2 2/9 3 V P3 



Holt = {-i)p a 2T -Lf. (4.52) 
P 



The longitudinal part has the same form as the simple oscillator and can be diago- 
nalized easily. The transverse part can also be diagonalized as shown in Chapter (2) 

Hot = 4otMotVot (4.53) 

where the summation of a is from 1 to N and the energy gap is Ui t = M- g and 
uj 20T = J\4 g . In terms of these operators we can write out the explicit form of p\ T 



j4r = EV^(« -«°)> ( 4 -54) 



V ' 

where the polarization factor is given by e\ = t\ = — e -416 * 9 ) 1 / 2 . 

To summarize, in the finite volume we would have the following Hamiltonian 

H = H + H int + LI {r>2T—L a o b + — ^P^rj > 

U \ ^ (+)a (-)a 

-"0 - 2^ fl ikA a ikA ^ikA, 
i,k,A 

ffint = E 4 Ao^(/i 2 + 0iT0iT) + A o (/i 2 + 0iT0iT) 2 - (4-55) 

X 

In this expression, the first two terms are just the Hamiltonian of the model in 
the broken phase in the infinite volume. The third term is a purely finite volume 
correction which describes the coupling between the zeromode and the rest of degrees 
of freedom. The index A takes the value L and T respectively. All the operators can 
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be expressed in terms of the creation and annihilation operators as 



M*) = E 



ikL 



(4.56) 



P = v ^^ + ff ) = v^(^ + E^^(41 + 41)) ) 

where the form factors are given by the following table 



col 



M 2 M" 



\ (m 2 - M 2 ){ml - M 2 ) 



__ 2 j ClL — C 2 l 



M 2 M 



\ (M 2 - ml)(M 2 - M 2 ) ' 



cql = 1, cit = c 



2T 



(4.57) 



The creation and annihilation operators enjoy the following commutation realtions 



(4.58) 



In the higher derivative model we have the similar relation for the O(N) generators 
acting on the ground state 



L?\0)=iY,9*&c&\0). 



(4.59) 



With these relations we can now calculate the rotator contribution to the energy of 
the state. Due to the selection rule for the operator p^, the first order correction 
vanishes. The lowest order correction comes in at the second order in the perturba- 
tion Hamiltonian. Using the representations of the operators in terms of the creation 
and annihilation operators, it is easy to show that the first correction is simply the 



96 



rotator energy, 

E$ = [1(1 + N - 2) + A N ]u r . (4.60) 

Therefore, just like in the conventional O(N) model in the broken phase, the rotator 
energy spectrum is the most densely spaced excitation and dominates the invariant 
correlation functions. 
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Chapter 5 

Simulation Results and Discussions 

5.1 Simulation Algorithms 

Finding the right algorithms for the higher derivative O(N) model has been 
quite tricky [5, 6]. In the beginning of this project, we ran many tests on the existing 
algorithms for our model. First we tried some conventional update algorithms, for 
example: metropolis, heatbath and hybrid Monte Carlo. But these type of algo- 
rithms had several serious problems. One of these problems was that due to the 
next-next nearest neighbor coupling terms in our model, the neighbor gathering pro- 
cess becomes a rather time consuming task. In four dimensions, with the naive 
discretization, we would have had to collect the field variables at 128 neighbors for 
every lattice point. Compared with the ordinary theory, this is a factor of 16 more. 
Another problem of such algorithms was the critical slowing down when close to 
the criticality. This second problem was understandable because, in our model, the 
spectrum of the Fourier modes is greatly broadened by the higher derivative term. 
To understand more about this issue, let us look at the autocorrelation time in a 
standard hybrid Monte Carlo algorithm. 

Consider the higher derivative free field theory governed by the Euclidean 
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Lagrangian 

L £ = ^>j0(p)0(-p), (5.1) 
p 

where the spectrum uj 2 = p 2 + p 6 /M 4 + Big. The acceptance and autocorrelationts 
in this Gaussian type of hybrid Monte Carlo has been studied by A. D. Kennedy et. 
al. [1]. The autocorrelation time of the algorithm was found to be: 



2r 



(5.2) 



1 - yfl- (2u min T ) 2 

where r is the average length of each hybrid trajectory. The quantity 0J min is the 
lowest frequency of the Fourier modes, i.e. u m i n = min p c<jp. The minimum of the 
autocorrelation time is obtained when r = l/{2uj min ) with the value r = l/uj min . For 
the stability of the leapfrog integration scheme, the step size cannot exceed (l/u max ), 
where u max = max p u p is the highest frequency of the Fourier modes. Therefore, the 
computer time that the algorithm consumes to generate an independent configuration 
is given by 

rp ^max fc q\ 

J- comp ~ • 

Thus, the computer time needed to generate an independent configuration greatly de- 
pends on how broad the extent of the spectrum. In the conventional model, the high- 



est frequency is given by u max = V16 + m 2 . The lowest frequency is just m. With the 
higher derivative term added, the extension of this frequency is much broader than 



the former case. The highest frequency changes to uj max = yl6 + (16/M 2 ) 3 + m? 
while the lowest frequency remains unchanged. For the parameter range of M where 
we perform our simulation, this highest frequency is larger by a factor of 10 or more. 
Therefore, the autocorrelation time is enormous for the higher derivative theory in 
standard hybrid Monte Carlo due to the broadening effect of the frequency. 
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For the Gaussian model, this effect can be overcome by the so-called Fourier 
acceleration procedure [2, 3, 4], which is nothing but noticing that the ideal algorithm 
for the free Lagrangian above is to perform the simulation in Fourier space by adding 
the momentum dependent kinetic energy part 

H = E :^*(P)*(-P) + ^0(P)0(-P). (5.4) 

This p-dependent kinetic energy part will take into account exactly the frequency 
differences of the modes and, in fact, the p-dependence for the step size then drops 
out completely from the Hamilton equation of motion, as one can easily check. This 
hybrid algorithm is then equivalent to simulating V independent harmonic oscillators 
with frequency 1 in lattice units. However, nobody would be impressed if one can 
simulate a free theory effectively. When the interaction terms are added, doing the 
simulation completely in Fourier space is sometimes hopeless. This is particularly 
true if the interaction is of the 4 type, which is completely local in real space, 
but highly nonlocal in Fourier space. Therefore, the hope is that we use a Fast 
Fourier Transformation program to go back and forth between the real space and 
the Fourier space. When the quadratic parts are evaluated, we go to the Fourier 
space, and when the interaction part is needed, we go to the real space. Obviously, 
this depends greatly on how fast one can do the Fourier transform. It turns out the 
existing FFT package runs reasonably well on the cray with a speed of 300 — 500 
Mflop on the C90-machine. Another complication is that in the interacting theory 
we do not know what type of p-dependent kinetic energy term to add. The only clue 
is perturbation theory, however, one would expect that the low energy modes should 
be very well described by the renormalized parameters. It turns out that the main 
effect is the broadening effect due to M, and M does not get renormalized very much. 
Therefore, putting in the bare value for M basically overcomes most of the critical 
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slowing down. We are able to perform the simulation with an autocorrelation time 
which is below 10 hybrid Monte Carlo trajectories with each trajectory consisting of 
15 — 20 steps. Although this performance is not ideal, it works thousands of times 
better than the old programs, for which the autocorrelation time was hopelessly 
long. Also, in the Fourier accelerated Hybrid Monte Carlo, it is trivial to extend 
the algorithm to the improved actions. Since the quadratic part is evaluated in the 
Fourier space, it does not cost anything more for us to use the improved propagator 
as compared with the naive one. If this were implemented in the real space, it would 
require a lot more work. 

All of our results were obtained with the appropriate Fourier accelerated 
Hybrid Monte Carlo program. We currently have only the version for the finite 
bare coupling constant. Therefore, all results presented here are for some finite 
bare coupling constant. However, some of our simulation points have a rather large 
bare coupling constant in continuum notation, therefore, we expect that most of the 
physically interesting results will be quite similar in the nonlinear limit. 

5.2 The Extraction of Physical Parameters 

We will now extract some physical quantities from our simulation results [6]. 
One of the most interesting quantities is the vacuum expectation value v. This is 
the quantity which sets the energy scale of the simulation. In the old simulations, 
this parameter was obtained by measuring the bare expectation value of the aver- 
aged field variable. The wave function renormalization constant was then obtained 
from a linear fit to the momentum space propagator. From these quantities, the 
renormalized vev is then obtained using 

v R = Z-^vq. (5.5) 
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The crucial point is the measurement of the wave function renormalization constant. 
But in our case, things are more complicated. The momentum space propagator will 
not only contain the usual p 2 term, but will also contain the higher derivative terms. 
In general, the interaction will generate more terms which were not in the bare free 
propagator. This makes it more difficult for us to get a very accurate determination 
of the wave function renormalization constant. 

Another way of extracting the renormalized vev is from the rotator correla- 
tion functions. Using the theory discussed in Chapter (4) , we can write down an 
expression for the rotator correlation function n a (0)n a (r), where n a (r) is the unit 
vector of the zeromode at a given time slice r 

(n a (T)n a (0)} = AY, Kl + l) e -^('('+ 1 )- 1 /2) cosh [( 2 / + l)(r - (3/2)oj r \, (5.6) 
i 

where u r = (2L 3 v 2 )~ 1 is the rotator energy unit. This correlation function is dom- 
inated by the rotator energy spectrum in the finite volume. All the other energy 
excitations are much higher than the rotator energy scale. Usually the lowest one is 
the one Higgs contamination, whose energy scale is an order of magnitude higher. 
This correction can be easily taken into account according to the formula given in 
Chapter (4) . Since the rotator energy depends only on the renormalized vacuum 
expectation value (and the 3 volume), this is a direct way of extracting the vev. In 
our simulations, we have tried both methods and have obtained compatible results. 

In Figure (5.1), a typical rotator correlation function is shown compared 
with the fit to the theoretical form. The bare parameters are shown at the top of 
the figure. The lattice size for this run is 16 3 x 40. The output data has a total 
statistic of 32k hybrid Monte Carlo trajectories. At very short distances, higher 
energy excitations will contribute. Therefore, the fit was performed from r = 6 all 
the way to the end. The fit is very stable if the starting point is after r = 5. The fit 
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Figure 5.1: The rotator correlation function is shown together with the theoretical 
fit. The fit starts at r = 6 and the quality of the fit is good. The disagreement of 
the theoretical curve with the data for small values of r is because of the high energy 
contaminations. 

is also very stable with respect to the number of rotator states (nmax in the figure) 
that has been included. It turns out that any number which is greater than 3 would 
be adequate. In this fit, the correction of the single Higgs state is included using the 
formula described in Chapter (4). This correction is about 10 percent even at large 
t values. This is because of the small vev value of our simulation. The corrections 
due to the other states are all very small at large r values. It is clear that we have 
found a very good agreement with the theoretical formula. 

For comparison, the momentum space Higgs propagator is shown in Fig- 
ure (5.2). This momentum propagator was obtained from a run of the same input 
bare parameters as in Figure (5.1) except that it was on a cubic geometry of 16 4 
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Figure 5.2: The momentum space Higgs propagator is plotted as a function of the 
lattice momentum squared for the bare parameters shown at the top. The solid curve 
is a fit of the data to the polynomial form up to order p 6 . The upper window is a 
magnified portion of the lower window in the range p 2 < 4. The quality of the fit is 
reasonable, however, due to the ambiguity of the fitting functional form, the error in 
the fitted wave function renormalization constant Z is rather large. 

with the statistic of 20k trajectories. The form of the fitting function is taken to be 
f(p 2 ) = Z~ l p 2 + Z~ l m 2 +P2P 4 +P3P 6 - Note that the size of the coefficient of p 4 term 
is quite significant which is a signal of strong interaction effects. We should keep in 
mind that the above function has no justification if the interaction is strong. In gen- 
eral, the interaction could introduce complicated functional forms to the full Higgs 
propagator. It could generate logp 2 terms, higher polynomial terms and even terms 
that cannot be written as functions of p alone. Therefore, the size of the interaction 
terms like p 4 basically reflects the ambiguity of the fit. If we had tried the same fit 
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but setting the coefficients of p 4 to zero, we would have arrived at a rather different 
value of Z ( Z~ x = 1.33 ). From this we conclude that, due to the strong interaction, 
it would be very difficult to extract the wave function renormalization constant from 
the momentum space propagator. Other methods are needed for the extraction of 
the physical parameters and the momentum space propagator can only be used as 
an independent check. 

Another important quantity is the mass of the Higgs particle. In the old 
simulations, there were also two ways of obtaining the Higgs mass. One way is to 
use a fit to the momentum space propagator. The mass obtained this way has both 
advantages and disadvantages. The advantage is that the signal is very clean and 
we get a very stable fit for the mass even with low statistics. We can fit the very 
low momentum portion of the momentum space propagator where the effects of the 
interaction terms are small and the mass values are rather stable. The disadvantage 
is that the mass obtained from the propagator is not yet the physical Higgs mass. 
We must use perturbation theory to relate the two masses. This is legitimate in the 
old 0(4) calculation because, in that case, the theory is perturbative and the per- 
turbative formula offers us a rather accurate prediction. In a truly nonperturbative 
theory, however, this could be misleading. The mass obtained from the propagator 
fit, what we call the off-shell mass, could deviate significantly from the physical mass. 

Another way of determining the Higgs mass is from the time slice correlation 
function of the Higgs field. In this approach, the lowest energy gap of the Higgs 
excitation is extracted and identified as the Higgs mass in the finite volume. This, 
of course, should be closer to the physical mass than the off-shell mass and, in a 
strongly interacting theory this is the only way to get a good control of the Higgs 
mass. In our simulation of the higher derivative theory, the interaction is much 
stronger than the conventional 0(4) case, therefore, we used this method to extract 
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the Higgs mass. The off shell Higgs mass was also determined and only served as a 
comparison. 
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Figure 5.3: The time sliced Higgs propagator is plotted as a function of the Eu- 
clidean time separation r for the bare parameters shown at the top. The solid curve 
is a fit of the data to the single Higgs excitation. The fit was done in the range 
5 < t < 17 to ensure that the higher energy excitations have died out. The quality 
of the fit is reasonable, but the error for the mass parameter remains to be deter- 
mined. 

In Figure (5.3), we have shown the time sliced Higgs correlation function as 
a function of the Euclidean time separation r. The bare parameters are also shown 
at the top of the figure. At small distances, all higher energy excitations contribute, 
including the ghost states. Therefore, to ensure that we extract the lowest radial 
excitation, we started the fit from some r values so that the fit was stable from there 
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on. The functional form that we used is the standard hyperbolic cosine function for 
a single excitation. The data of the correlation function is derived from a blocking 
analysis of 32k hybrid Monte Carlo trajectories. If we compare this fitted mass value 
with the off shell mass, we find that the difference is very significant, which means 
the interaction is really much stronger when compared with the conventional 0(4) 
case. The data points of the correlation function are highly correlated. Therefore, 
we should develop a method to determine the error of the fitted mass value. 

To determine the error of the mass parameter, we performed the following 
blocking procedure. The output data is originally divided into small blocks. For 
this particular example, we had 80 blocks available. Due the large fluctuation, a 
single block is not enough to give stable mass values. Therefore, the small blocks are 
first grouped together to form N b larger blocks, large enough so that we can extract 
stable mass values from them. For each large block i, the following ratio is formed 

_ G,(r + 1) - G,(r) 

= G, W -G, (r -l)> (57) 
where % runs from 1 to the total number of large blocks N b . If we have only a single 
excitation that dominates the correlation function, then the correlation function 
should be of the form 

G theo (r) = A cosh[m(r - L t /2)\ + B. (5.8) 

Therefore, the ratio should only depend on the mass m and the Euclidean time 
separation r, 

theo( . = cosh[m(r + 1 - L t /2)] - cosh[m(r - L t /2)] 

{ ' cosh[m(r - L t /2)} - cosh[m(r - 1 - L t /2)} ' 1 ' ' 

Then the blocked values R{(t) are set to the theoretical value and we can solve for the 
mass numerically for each r. The outcome of this procedure is called the "effective 
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mass", denoted as m % e ^{r). Then, the averaged effective mass is obtained by 

1 Nb 

b i=l 

We can also obtain an error for the effective mass by 
Am eff (r) 



(5.10) 



(5.11) 



We can then plot the effective mass as a function of the time separation r, together 
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Figure 5.4: The effective mass plot for the time slice Higgs correlation function for 
the bare parameters listed at the top. The Higgs mass value is obtained from the 
X 2 fit to the plateau starting at r = 7. The dashed line tick marks denote the range 
of the fit. The horizontal solid line is the fitted mass value which is also labeled in 
the figure. The horizontal dashed lines denotes the error of the fitted mass value. 
The mass value from the effective mass plot is consistent with the value from the 
exponential fit. 
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with the appropriate errors. This is shown in Figure (5.4). We found that, since many 
states contribute for small values of r, the effective mass is varying with r. However, 
if we go to r values that are large enough, all the higher energy excitations die out 
exponentially and the lowest energy excitation dominates. Therefore starting from 
some t value, we should see a plateau behavior of the effective mass. The value of 
the plateau should basically be the energy of the lowest energy excitation. Since the 
signal is getting exponentially small with r, the error of the effective mass function 
will grow significantly with r. Usually near the endpoint (r = L t /2), the errors 
become so large that effective mass value is no longer meaningful. We can then 
perform a x 2 fit to the effective mass, giving higher weight to the more accurate 
points. From this fit, we can determine the mass and its error. 

But this is not the whole story yet. In fact even in the second approach, 
what we extract is not the infinite volume Higgs mass. The reason for this is very 
simple. All the simulations are done in a finite volume, and finite size effects must 
be taken into account. Among all the finite effects, there is one effect that is most 
disturbing. In the infinite volume, the Goldstone particles are exactly massless. 
Therefore, the Higgs particle can decay into two Goldstone particles, thus the Higgs 
has a finite lifetime. In the simulation, however, because the volume is finite, the 
lowest Goldstone pair is not at zero energy, but is equal to 4ir/L. This number is 
rather large for most of our simulations. In fact, it is larger than the Higgs mass 
itself. So the situation that we have in our simulation is that the Higgs is lighter than 
the Goldstone pair, and it therefore cannot decay. Of course, when the volume is 
increased, the Higgs mass energy level will meet the two Goldstone levels and the so- 
called level crossing phenomenon occurs. This was noticed quite some time ago. In 
fact, many groups have used this picture to get both the physical Higgs mass and its 
width from the measurement of the two Goldstone levels. In this picture, the Higgs 
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is viewed as a resonance of the Goldstone Goldstone scattering process. Liischer 
derived a formula which relates the Goldstone pair energy level in the finite volume 
to the infinite volume Goldstone- Goldstone scattering phase shift. By measuring the 
two Goldstone energy levels as accurately as possible for various volumes, one gets 
the continuum scattering phase shift profile in an energy range. If all the parameters 
are well chosen, one would be able to see a phase shift stepping from almost zero 
to almost 7T exactly at the threshold energy which is equal to the physical Higgs 
mass. One would also be able to get the physical width of the Higgs by fitting it to 
the Breit-Wigner shape near the resonance. So, instead of fighting against the finite 
volume effects, one could utilize it to gain precious information about the continuum 
theory. 

To carry out a similar calculation in our model is more difficult than the 
usual O(N) model. First of all, we must establish an equivalent formula in the higher 
derivative theory which can relate the energy levels in the finite volume to the phase 
shift in the infinite volume. Secondly, we have extra particles in our model, namely 
the ghost pairs. We have to control their contribution to the correlation functions 
in order to get reliable results for the two Goldstone energy levels. Thirdly, our 
model requires much more computing power to get good stable results for the time 
sliced correlation functions. The detailed analysis of this problem is given in the 
next chapter. 

The simulation results we have obtained belong to one of the following two 
categories. One is performed with the naive discretization action and the other 
category is performed by using the improved action. We have done simulations in 
both phases of the theory. The following table summarizes the bare parameter and 
extracted physical quantities of the points. In this table, points A through G are the 
results for the naive action while points H through J are for the improved action. 
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0.33(2) 


7.3(5) 


C 


0.105 
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0.0798(1) 
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0.24(1) 


2.9(1) 


E 


0.081 


1.0 


0.3 


16 3 * 40 
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0.0477(1) 
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0.0354(1) 


0.045(1) 
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2.0 


0.4 


16 3 * 16 


100k 






0.352(5) 


2.2(2) 



Point G and point J are in the symmetric phase, while all other points are in the 
broken phase. 

In the symmetric phase, the important physical quantity is the renormal- 
ized coupling constant, which could be defined to be the connected 4-point function 
at zero external momenta. In order to get this quantity, the propagator mass is 
measured. The renormalized coupling constant is directly measured by forming the 
connected 4-point function. The measurement of the renormalized coupling con- 
stant is very noisy, which requires large statistics of the data. We used the following 
formula to extract the connected four point function 

Xr ~ ^4" ( ivT^ { W) 2 J ' ( } 

where N is the number of components of the field, uir is the propagator mass and 
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is the 4-volume of the system. The quantity 2 is defined to be J2a=i 4> a 4> a where 
(p a is the 4-volume average of the field <p a (x). The quantity 4 is just a short hand 
notation for (0 2 ) 2 , and the bracket means the Monte Carlo ensemble average. It is 
the subtraction in the bracket which causes most of the noise. Therefore, in order 
to get sensible results we have accumulated large statistics for the two points in the 
symmetric phase (Point G and J in the table). In Figure (5.5), we have shown the 

m R = 0.352 



- A R = 2. 28061 ±0.307542 
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Figure 5.5: The renormalized coupling constant (connected four point function at 
zero external momenta) is plotted for individual runs. Due to the subtraction the 
signal is quite noisy and a large statistic is needed to get a sensible accuracy for this 
quantity. 

renormalized coupling constant for individual runs for the higher derivative 0(4) 
model. It can be seen that the result is quite noisy and the points scatter a lot 
around the average. Usually 100 K is needed for an error of about 10 percent. 
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We can now compare the renormalized coupling constant that we measured 
for the higher derivative 0(4) model with that of the conventional 0(4) model [7]. 
In the symmetric phase (point G and J), we found that the renormalized coupling 
constants were much larger than in the old lattice simulation results of 0(4) model. 
In the conventional 0(4) model, when the correlation length was about 2 — 3 the 
renormalized coupling constant \r was typically of the order of 0.6 — 0.8. In our 
model, however, we saw a huge jump (about a factor of 2 to 3) of the renormalized 
coupling constant. This is a signal that the higher derivative model is much more 
strongly coupled than the conventional 0(4) model. Recall that, from the large 
N calculations in Chapter (3), large N also predicts a jump in the renormalized 
coupling constant in the symmetric phase. Therefore, our simulation results agree 
with the large N results qualitatively. 

In the broken phase, the renormalized vacuum expectation values are ob- 
tained using the rotator correlation functions as described above. The errors are 
estimated from a blocking analysis of the data. 

The Higgs mass is taken to be lowest radial energy excitation in the finite 
volume. As described above, we tried two ways of extracting this energy gap. One 
by fitting the time sliced correlation function to the hyperbolic cosine function, the 
other from the effective mass plot. Both methods gave compatible results and the 
errors are determined from the \ 2 fit of the effective mass plateau in the appropriate 
range. 

Identifying this energy gap with the infinite volume Higgs mass is of course a 
rather crude approximation and is subject to finite volume corrections. However, as 
shown in the table, we did not see a significant change in the ran/vR ratio when the 
lattice volume was increased. In fact, they are compatible with each other within 
errors. We also tested this within the framework of the large N approximation. 
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We found that the ratio E(L)/v(L) was rather stable when the size of the box 
was changed, as long as the box size was not too small and the energy crossing 
phenomenon had not occurred. And the value of the ratio was in agreement with 
the infinite volume large N value. Therefore, we expect that this ratio represents 
the feature of the continuum higher derivative theory. The correct way of extracting 
the Higgs mass has to come from the finite volume resonance picture, which we will 
discuss in the next chapter. 

Another issue in the Higgs mass bound problem is to determine how much 
scaling violations (cutoff effects) are present in our results. This turns out to be 
a rather subtle issue. To study this problem, we have to answer the following two 
questions: (1) what is the nature of the scaling violations in our model and, (2) how 
can we calculate the scaling violations once the Higgs mass and the ghost parameters 
are known. 

First we will review how the above two questions are answered in the con- 
ventional O(N) model simulations. In the conventional O(N) model, the scaling 
violation is due to the hypercubic lattice that violates Euclidean (or rotational) 
invariance. This scaling violation can be defined both perturbatively and nonpertur- 
batively. To calculate this scaling violation, we can check the rotational invariance 
of some quantity, for example, the free propagator of the field [12], or evaluate the 
Goldstone scattering amplitude and compare with perturbation theory [8, 9, 10, 11]. 
The second method seems to be more closely related to measurable quantities, but it 
relies on the perturbative nature of the problem. It worked out nicely for the conven- 
tional 0(4) simply because even at the highest bound, the theory is still perturbative. 
The first method offers us an unambiguous result without using perturbation theory. 

Now, let us look at the situation for the higher derivative lattice theory. 
People tend to think that in the higher derivative 0(4) theory there exist two types 
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of scaling violations. One is the effect due to the lattice; the other one is what is 
usually called the Pauli-Villar cutoff (or ghost) effects. However, such a statement 
is very misleading. In fact, as we have shown in the previous chapters, this should 
not be the view, at least not the only view, of the higher derivative theory. This 
theory is a well defined field theory which has a unitary S'-matrix and the ghost 
effects can easily evade the experimental tests. It is also a well-defined theory free 
of divergences. Therefore, if we could do the simulation in the continuum, we would 
have had no cutoff effects at all. It is only because the computer cannot handle 
infinite number of variables that we have to introduce the underline lattice to the 
theory. As long as we can constrain our lattice effects to be small, our simulation 
results should represent the higher derivative 0(4) model in the continuum. In other 
words, there are no "ghost effects" if the ghosts are well hidden from any experiment. 

As stated previously, in analyzing the lattice effects, perturbation theory 
should only be taken as a hint. There have been ways of doing nonperturbative 
analysis of the lattice effects, though none of them is really sophisticated. One of 
the things that could be done is to analyze the breaking of the Euclidean invariance 
of the free propagator at some given parameters. This was first discussed by Lang 
et. al. in 1988 [12]. Although it only uses the tree level propagator, it is still a 
very good measurement of the amount of lattice violations in the theory. Obviously, 
going beyond this using perturbation theory is hopeless if the theory is strongly 
interacting. One can try to carry out the same analysis for the propagator in the 
large N approximation, but again, the justification for the large N approximation 
at N = 4 is not very promising either. 

Let us now review some of the basic ideas of how this procedure is car- 
ried out for the propagator. On the lattice, the propagator in momentum space 
is, in general, a function of every individual momentum component. In the contin- 
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uum, however, it should only depend on the combination p 2 = Y^i=iV^V^ due to 
Euclidean invariance. This symmetry is violated on the lattice and we can define 
a quantity Mg which represents the amount of violation due to the lattice. For 
the inverse momentum space propagator, the quantity Mg is defined in the follow- 
ing way. Let us pick some prescribed momentum scale p cut in lattice units, and 
pick our reference momentum to be p = (p cut , 0,0,0). Then we can form all the 
momenta that have the same magnitude as this reference momentum in the form 
TZpo = Pcut(cos 9i, sin 6>x cos 9 2 , cos 6\ sin 9 2 cos 9 3 , cos 6\ sin 6 2 sin 6> 3 ). We can then de- 
fine the rotational invariance violation by Mg by 



where d7Z is the invariant measure for the rotational group normalize in such a way 
that / dlZ = 1. Obviously this quantity is identically zero in the continuum where 
the rotational invariance is restored. On the lattice, the size of this quantity is a 
measure of the lattice effects in the discretized theory. In principle we can define 
similar quantities for other functions. 

We have performed the rotation invariance analysis for our simulation points 
using both the tree level and large N approximation. In Figure (5.6), we have 
shown some of the rotational invariance violations for the tree level propagator of 
our simulation points We found that all our simulation points have very small lattice 
effects. For example, even with the naive propagator, in the Higgs mass range where 
we did our simulation, the rotational invariance violation is not larger than the old 
0(4) simulation points with correlation length of 2 — 3. 

We can also calculate the finite volume lattice violation in both the free 
propagator and in the large N approximation. In a finite box with lattice structure 
the lattice momenta are discrete and can only be multiples of 2n/L. For each integer 




(5.13) 
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Figure 5.6: The rotational invariance violation for the free inverse propagator on 
an infinite hypercubic lattice is plotted for various cases. The bottom two curves 
are the naive propagator and the one using the improvement up to 14th order. The 
upper two boxes show the Pauli-Villars case for M = 2 (the solid lines) and M = 0.8 
(the dashed lines) when using the naive and improved action. It is clearly seen that 
for the parameters that our simulation are performed , the rotational invariance 
violation is very small. 

n cut there will be more than one set of solution (n(\ n^, r$) to the equation 
n cu t = n\ + n\ + n§ + n\. Denoting the total number of solutions by D{n cut ), we can 
then define the counterpart of Mg in the finite lattice 



G(n cut ) 



2^ G ((-r) n i , {-rm, (-pK , (vK )' 



D(n cut ) 



(5.14) 



i=i 



Main-cut) 



1 Di ^ t} (G(Cf)nf\ Cf)n%\ ( 2 f)4\ (fjrg) - G{n cut )Y 
\D{n cut ) ^ 



G{n cut ) 2 

Due the finite size effects, the momentum lattice is coarse grained. This will result 
in some zigzag behavior of the function A/g(?w), as n cut is increasing. However, for 
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a reasonably large lattice, we will recover the infinite lattice results. In Figure (5.7), 
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Figure 5.7: The rotational invariance violation for the large N propagator is plotted 
different lattice sizes. The bare parameters are chosen to be close to the ones in our 
simulation. For small lattices, because the momentum is discrete, the function is 
not smooth. But for the larger lattices the function approaches the infinite volume 
result. 



this rotational violation is shown for one of our simulation points for the large N 
propagator. All the rotational invariance violation are well under one percent level. 
We are therefore confident that our results should represent the features of the higher 
derivative theory in the continuum. 
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Chapter 6 

Extracting Scattering Phase Shift 
Using Finite Size Techniques 

6.1 Resonance in Finite Volume 

In the previous chapter we argued that extracting the mass parameter of an 
unstable particle in a finite volume is not a trivial task. For the volume that people 
usually perform their Monte Carlo simulations, the lowest two Goldstone particle 
state has an energy eigenvalue which is higher than the Higgs mass parameter. This 
means that in such volumes the Higgs cannot decay into the Goldstone pair as it 
should in the infinite volume, even if the interaction between the Higgs and two 
Goldstone state is turned on. This problem can be solved in two ways if the theory 
is only weakly interacting. In the first conventional way, one tries to extract the 
propagator mass in the finite volume, then the finite volume corrections are added 
to get the propagator mass in the continuum infinite volume. After that the pertur- 
bation theory is used again to relate the propagator mass to the on-shell physical 
mass of the Higgs particle. The width of the Higgs can also be calculated using 
perturbation theory. This method heavily utilizes perturbation theory. The second 
method is to extract the infinite volume continuum results directly by measuring 
some quantity in the finite volume. With this method, one needs a general formula 
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which will relate the infinite volume quantities to the finite volume quantities with- 
out using perturbation theory. In the conventional 0(4) simulations both methods 
have been tried and they give compatible results. It is obvious that for our higher 
derivative O(N) model, due to its strong interaction, only the second one can be 
used to analyze the finite size effects. In fact, the basic idea of the second approach 
is to make use of the finite size effects instead of fighting them. Let us now review 
some of the basic ideas of this approach. 

We start with the conventional 0(N) model without the higher derivatives. 
The basic particle excitations in the broken phase in a finite box consist of Higgs ex- 
citations and Goldstone excitations. Consider the eigenstate of one Higgs excitation 

and the eigenstates of two Goldstone excitations. Due to the Euclidean invariance, 
1.5 r-rr 
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Figure 6.1: The zeroth order of the level crossing is shown schematically as a 
function of box size L. The Higgs excitation is L independent while the two Goldstone 
excitations is decreasing with L. At some value of L, the Higgs energy level will cross 
the Goldstone energy levels and if the interaction between the two is turned on, the 
two levels will repel each other and split. 
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we can take the rest frame of the Higgs particle. There is a major difference between 
the energy of the Higgs excitation and the Goldstone excitations. The Goldstone 
pair with opposite three momenta has an energy eigenvalue which is dependent of 
the box size. The lowest one is An / L where L is cubic box size. The Higgs excita- 
tion, however, will not depend on the box size and is just a constant. To the lowest 
order, these particles behave just like free particles. In Figure (6.1), the dependence 
of these eigenvalues are shown. After the interaction is turned on, the one Higgs 
excitation mixes with the Goldstone pair excitation. For small box sizes, when the 
Higgs mass is below the lowest Goldstone pair excitation energy for that box size, 
ordinary perturbation theory will give us the correction of these energy levels if the 
interaction is not too strong. For some larger box size, however, the Higgs excitation 
will cross the Goldstone pair excitation and for that particular box size, degenerate 
perturbation theory should be used to calculate the level crossing. If the interaction 
is weak, one would expect a plateau in a range of L which should be identified as the 
physical Higgs mass and the splitting at the crossing point basically gives you the 
width of the Higgs particle. In order to use this picture of resonance in a finite box, 
a nonperturbative relation must be established from which one can get the relation 
between infinite volume quantities and the finite volume quantities. Finally, if the 
finite volume quantities are measured in the Monte Carlo simulations, we can use 
this relation to deduce the infinite volume results nonperturbatively. 

6.2 Liischer's Formula 

In the infinite volume, the Higgs particle is identified as a resonance in the 
isospin channel in Goldstone-Goldstone scattering. As in any two particle scatter- 
ing process, the scattering cross section is characterized by the scattering phase shift 
6(E) at a given center of mass energy. When the center of mass energy is at the 



124 



physical Higgs mass, we see a peak in the scattering cross section and the scattering 
phase shift rises dramatically from almost zero to almost ir. In this case, there is 
a resonance in the scattering process and a Higgs particle is produced. The mass 
of the Higgs particle is identified by the position of the peak or equivalently by the 
energy at which 5(E) crosses ir/2. The width of the particle is given by the range 
in which 5(E) steps from almost to almost ir. For an ideal resonance, that is the 
resonance which is infinitely narrow, the scattering phase shift will step up exactly 
7r. But for wide resonances the sharpness and the height of the step is greatly re- 
duced. Therefore, the scattering phase shift in the infinite volume fully describes the 
properties of the Higgs particle. 

People have derived a relation which relates the infinite volume phase shift 
to the two Goldstone particle energy eigenvalues in the finite volume. This relation, 
with the name Luscher's formula, was derived first by DeWitt in a different form 
[5]. Later Liischer rederived it and expreseed in a form suitable for nonperturbative 
Monte Carlo simulations [1, 2]. It was used by Zimmermann et. al. to study the 
conventional 0(4) model and proved to work very well [3, 4]. We now derive this 
formula with a method that is based on DeWitt, since this can be easily generalized 
to the higher derivative case. 

Consider a quantum mechanical system governed by the Hamiltonian H = 
H + V in a three dimensional box whose side is L. We can define the resolvent 
operators G(z) and G°(z) as 

G(z) = (z-H)- 1 G°(z) = (z-H )-\ 



G(z) = G°(z) + G°(z)VG(z), (6.1) 
where z is just an arbitrary complex number. Consider the matrix elements of 
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G(z) between two states |a) and \(3). where \a) and \(3) are eigenstates of the free 
Hamiltonian H , i.e. H \a) = E a \a), Hq\/3) = Ep\f3). We have 

(a\G(z)\(3) = -^-r(5 a( s + (a\VG(z)\(3)). (6.2) 
z - hj a 

Let us now define the self energy operator such that 

Then the matrix element of G(z) may be written as 

(a\G{z)\(3) = -^rr(<W + (a\Z{z)\())<J)\G(z)\P)). (6.4) 
z- b a 

The self energy operator defined above satisfies the following integral equations 

(a\Z(z)\(3) = (a\V\(3) + -£( a \V\ 1 )-^—( 1 \E(z)\/3). (6.5) 

Setting a = j3 in the above equation we get 

(a\G(z)\P) = (z-E a - (a\i:(z)\a)y l . (6.6) 

Note that the pole of G(z) in the complex z plain should be the exact eigenvalue of 
state | a), and we get 

e a - E a = (a\E(e a )\a), (6.7) 

where e a is the eigenvalue of the full Hamiltonian for the state that is perturbed from 
| a). This formula tells us that the expectation value of the self energy operator in 
some state gives us the so-called energy shift which is the energy difference between 
the exact eigenvalues and the free eigenvalues. 

Let us now look at this integral equation in a very large box, where the 
intermediate states are very dense and we would expect to be able to go to the 
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continuum limit. In fact, we can write down an expression 



+ E = V-^rr + S(z - H )*(z), (6.8) 

- z — £/ 7 Z — Hq 

where the function <&(z) is given by the energy shell sum 

This relation is obtained in the following way. Imagine that z is some real number 
and we take some small real positive number e and divide the real axis into a small 
interval [z — e, z + e) and the rest. For a very large box, the eigenvalues E 1 will be 
very dense and they are treated separately, depending on whether they fall in the 
interval or not. For those states whose eigenvalues fall outside the small interval, 
the sum will better approximate the principal valued expression if we take smaller 
e values. For any fixed e there will be infinite eigenvalues which fall into the small 
interval, as long as the box size is going to infinity. For these states, if the operator 
is inserted in some smooth function of the energy, they are equivalent to the delta 
function which selects out the specific energy. The function $(z) is basically the 
degeneracy sum of all the states that has almost the same energy in the infinite 
volume. 

With this relation we can rewrite Equation (6.4) in the following way 

(a\G(z)\P) = (a\V\/3) + £< a |vJ^E(s)|/?> - (a\V\(3) , (6 . 10 ) 

7 z — £/ 7 Z — hi p 

If we now take z = tfs and make use of Equation (6.7) we get 

(a\X(z)\f3) = (o|(l - VV^—-Y 1 Vb{z - H )X(z)\(3)<S>(z). (6.11) 
We can then multiply both sides by a factor of 27rS(E a —Ep). Note that the scattering 
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phase shift operator is given by 

(a\ - 2tan5|/3) = 2n5(E a - E p )(a\(l - VV - 1 - Y^\P), (6.12) 

also we can take our states \a) to be diagonal in angular momentum, we then get 

$A a (e a ) = -ncot5 Xa (e a ). (6.13) 

This is the basic formula which relates the scattering phase shift in the infinite 
volume to the exact energy eigenvalues in the finite volume. To be specific with 
the function $, note that in the isospin channel of two Goldstone particles with 
opposite momenta k, we have the relation 

^P^*^^ (6 - 14) 
where E\ is the energy of one Goldstone particle. We get dE = A7r 2 /(L 3 kEi) ,so 

2tt 2 1 



*<>(*) = E 



= E 



L 3 kE^z/2-E^k)) 

4tt 2 1 

L 3 k(k 2 - (f) 2n2 ) 



= _-l-Zoo(l,g 2 ), (6.15) 

where we have used the dispersion relation for the Goldstone z/2 = k and q = 
kL/ (2n). The zeta function is defined to be 

^W)=E^l (6-16) 
V n 2 - q 2 

where the function 3^ m (n) is the usual spherical harmonics. When this expression is 
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substituted into the general formula, we get 



cot S (E) 



1 



^oo(i,g 2 ), 



E 2 /A 




m n = 0, 



kL 



(6.17) 



Q 



This is exactly Luscher's formula that has been used by Zimmermann et. al. in their 
simulation except that they were working with the nonzero mass case for the Gold- 
stone particle. It is clear from the above derivation that the condition of the massive 
Goldstone is not necessary. In fact, as we will see below, we have tested the massless 
case in the conventional 0(4) case and got consistent results with Zimmermann et. 



This problem can be understood in the following way. Recall that in Luscher's 
derivation of the formula, he assumes that the pion (Goldstone) has a finite mass 
due to the non-vanishing external source. This external source tilts the potential 
and makes the potential lower in the direction of the external source. Therefore in 
the potential valley it is not flat but rather has a slope. This is why the Goldstone 
particles become massive. Then the Higgs field is defined to be the 4- volume average 
of the field variable along that particular direction. The Goldstone field is defined 
to be the field along the directions that are orthogonal to the Higgs field. However, 
when the external source is gets smaller, the tilting in the potential becomes weaker. 
As a result, the fluctuation around the Higgs direction becomes stronger. In the 
limit of a vanishing external source, the special direction is not defined at all and 
the potential becomes totally O(N) invariant. It is clear that in this limit, the Higgs 
field and the pion field is not well-defined. This is also seen in the finite volume 



al.. 



129 



correction of the pion mass. Therefore, if the box is too small, we will not be able to 
disentangle the energy correction to the one pion energy and the interaction between 
the two. In this case, the correction to the single pion mass depends on the quantity 
m v L exponentially. When the pion mass goes to zero, the finite size correction to 
the single pion energy will be very large. This is the main reason that one has to 
take the nonzero pion mass. 

In fact, the situation in the O(N) model is more subtle. First of all, the 
massless pion dispersion relation is protected by the symmetry in the very large 
volume limit. Therefore, the energy of a single pion would be exactly multiple of 
27r/L, even if the interaction is turned on. Exactly at the vanishing external source, 
we know that the above picture is not a good picture of the symmetry breaking 
mechanism in the finite volume. Instead, we should use the Born-Oppenheimer 
picture discussed in Chapter (4) . In the Born-Oppenheimer picture, the pions are 
massless, and the zero momentum pion is replaced by the rotator excitations. The 
Higgs field can also be meaningfully defined. As we have seen in Chapter (4) , there 
will be no large finite volume corrections to the two pion energy hence the energy 
shift in the finite box totally reflects the interaction between the two pions. Note 
that there is no contradiction to the finite external source case. If the external 
source is present and significantly different from 0, then the conventional picture of 
the massive pion works very well and the Born-Oppenheimer picture would be a very 
bad approximation since the potential is so tilted. On the other hand, the Born- 
Oppenheimer picture is valid for very small external source where the conventional 
picture breaks down. Therefore, our conclusion is that Liisher's formula will still 
work even in the massless pion case, as long as the field definitions are adjusted 
according to the Born-Oppenheimer picture described in detail in Chapter (4). 
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6.3 Integral Representation of the Zeta Function 

In the Liischer's formula, the zeta function needs to be dealt with carefully. 
For the cubic geometry, it turns out that a useful integral representation of the 
function exists which we will now discuss [1]. 

In general, the zeta function is defined to be 

Zi m {s,q 2 ) = E^ m (n)(n 2 -g 2 )- s , (6.18) 
nez 

where the symbol ^m(n) stands for the usual spherical harmonic functions and the 
summation is over all the three dimensional integers. In order to derive the integral 
representation, let us also define the heat kernel by 



V e^ (x " 2?rn)2 (6 19) 



We will also need the truncated heat kernel 



KL(t,*) = tA^ E 3Wn)e«-*- ta2 . (6.20) 
W |n|>A 

Then the zeta function has the following representation 

Z lm ( Sl q 2 ) = ^ m (n)(n 2 -g 2 )- s 

|n|<A 

+ TTT r 'dtt-^/C^O), (6.21) 
1 {s) Jo 

as long as s satisfies the condition Re(s) > 1/2 + 3/2. Note that the combination 
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exp(tg 2 )/C / A m (t, 0) has the following asymptotic behavior 

e tq ^L(t,0) ~ S J^ t -^ + 0(t-V% f-0. (6.22) 

Therefore we immediately have the following analytic continuation for the zeta func- 
tion in the range Re(s) > 1/2, 

"" <S - q> = ^J""^ " ] + Wi w^-3/2) 

+ J" dU-'e"'K} m (t,0)) ■ (6.23) 
In particular for s = 1 we have 

z lm ^q 2 ) = E ^WnXn 2 -? 2 )- 1 

|n|<A 

+ ^) 3 /f*(^KL«.0)-(^i). (6.24) 

The above integral representation is suitable for numerical evaluation of the zeta 
function. The integrand is evaluated for any t value and the integrals are performed 
numerically using the standard integration subroutines (e.g. IMSL). When evaluat- 
ing the integrand one has to distinguish the case for t > 1 and t < 1. The first line 
in the representation (6.19) is used for the case t > 1 while the second line is used 
for the case s < 1 for better convergence. It turns out that an accurate numerical 
answer can be obtained for q 2 values not larger than 10, which is the case in the 
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practical applications. 

6.4 Simulation Results on the Conventional 0(4) 
Model 

As mentioned in the previous section, we performed a test simulation first 
on the conventional 0(4) model without the external source term. Therefore, our 
pion dispersion is the massless dispersion. The simulation was done using a cluster 
update program which runs very efficiently on the alpha AXP workstation. In the 
simulation, we made measurements after every 10 cluster updates and for each lattice 
size a 100,000 to 200,000 measurements were accumulated. The operators that we 
took into account were the radial Higgs field, and the four lowest pion pair states. 
We chose our simulation point so that the Higgs mass would come out around 0.6. 
We were also working in the nonlinear limit of the 0(4) model and the input bare 
parameter was the hopping parameter k which we fixed to be 0.315. Old simulation 
results indicate that the Higgs mass for this point should be m H = 0.581. We scanned 
the size of our 3- volume from 8 to 24 with a step of 2. The correlation functions 
were then analyzed to extract the energy levels in this isospin zero channel. The 
correlation matrix was diagonalized and the eigenvalues were used to extract the 
energy levels. The errors of the energy levels were obtained by blocking the data. 
These errors in the energy levels then translate into the errors in the phase shift when 
using Luscher's formula. The final results can be summarized in Figure (6.2), where 
we have plotted the the scattering phase shift as a function of the center of mass 
energy obtained from the application of Luscher's formula (the data points). The 
solid line is a perturbative fit to the data which yields a mass and width compatible 
with the expected results. The dashed line represent the perturbative results when 
the old values of m H are substituted in. The highest point is for the lattice size 
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Figure 6.2: The scattering phase shift is extracted using Liischer's formula in the 
isospin channel with zero Goldstone mass. The solid line is the fit to the relativistic 
Breit-Wigner shape and the fitted values of Higgs mass and width are also shown. 
This is in good agreement with the perturbative prediction (dashed line). 

8 3 x 32 and this point overshoots the expected values. This could be because of 
the lattice effects and the subleading finite volume effects. The rest of the points 
agree nicely with the perturbative results. The large error bars for the larger lattices 
is purely due to the lack of the statistics. This plot is a clear indication that the 
formula also works in the massless long as we define our Higgs field properly. 
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6.5 Liischer's Fomula for Higher Derivative The- 
ory 

The relation in the higher derivative theory is quite similar to that in the 
conventional theory. We just have to repeat most parts of the previous derivation 
and make adjustments accordingly. In the case of the higher derivative theory, the 
Hamiltonian is still self adjoint, i.e. 

r]H^r] = H , riHlri = H . (6.25) 

We will still define the resolvent operators as 

G(z) = (z-H)- 1 , G (z) = (z - Ho)- 1 , 

G{z) = G (z)+G (z)H 1 G(z). (6.26) 
We will set up the basis \a) such that 

H \a) = E a \a), ^ \a)(a\r) = 1, (6.27) 

a 

where the eigenvalue E a could be complex. The state are chosen to satisfy (J3\r]\a) = 
5 a p. We can now define the self energy operator according to 

and it satisfies the following integral equation 

(amz)\(3) = (alrjH.lP) + ]>> - J B 7 )- 1 (a|7 7 // 1 | 7 )(7|^)|/3). (6.29) 
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Similarly the energy shift is given by the diagonal matrix element of the self energy 
operator. 

AE a = {a\r,Y,{E a + AE a \a). (6.30) 

In the large 3- volume limit, the intermediate state summation in the integral equation 
of the self energy reduce to the following 

^ z — E ^ z — E ^ z — E 



„ \ ^ Z — -E-y 2 — -ffn 

7,|£ 7 -2:|<e 7 U 

= ^-// )^)+^^— . (6.31) 

z — Hq 

Taking z = Ep + AEg we again arrive at 

{<x\rjZ{z)\P) = (a\ V (l - H.V^—)- 1 HJ{z - H )X(z)\P)$(z). (6.32) 

z — n 

The delta function in the above equation restricts the intermediate states summation 
to take only the real energy eigenstates. The matrix element of the operator (1 — 
HiP^h^I^Hi between the physical states is nothing but the phase shift as can be 
verified from the general formula established in Chapter (3). Therefore, we would 
conclude that the Liischer's formula will still work, even in the case of the higher 
derivative theory with ghost states. 
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6.6 Phase Shift for Higher Derivative Theory in 
1/N Expansion 

As we have established the validity for the Liischer's formula for the higher 
derivative, it is very instructive to show that this indeed will work out in the large N 
expansion. Recall that the continuum scattering phase shift for the higher derivative 
theory has been calculated in Chapter (3) in the large N expansion. There the 
unitarity is maintained in the large N expansion and the phase shift could have 
v=0.18 M = 5 A = 720 = O.25tt 
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Figure 6.3: The result of the scattering phase shift for the higher derivative theory 
in the large N limit is shown as a function of the center of mass energy. The data 
points are obtained from the two Goldstone particle energy eigenvalues in the finite 
cubic box in the large N limit by applying Liischer's formula. The solid line is 
the continuum large N calculation for the same set of parameters as described in 
Chapter (3). The corresponding cross section is also shown. The agreement of the 
two methods is clearly seen. 
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some microscopic acausal effects. To verify that Liischer's formula in the higher 
derivative theory, we have evaluated the energy eigenvalues for the two Goldstone 
particle in the isospin channel in the large N limit. This is done by solving for the 
real roots of the matrix element (h\(z 2 — H 2 )~ l \h). In the leading order of the 1/N 
expansion this matrix element reduces to the geometric summation of the Goldstone 
bubbles. Then these eigenvalues are substituted into Liischer's formula to extract the 
phase shift, which is then plotted against the center of mass energy of the scattering. 
In Figure (6.3), the result of this calculation is shown. The points are the phase 
shift values obtained from Liischer's formula. The solid line is the corresponding 
continuum calculation of the phase shift as described in Chapter (3). It is evident 
that the formula is working very well. We have tried the same procedure for other 
set of parameters and they all give good agreement with the continuum calculations. 

6.7 Phase Shift Simulations for Higher Derivative 
Theory 

The result from the large N expansion makes us confident that the same 
procedure could be carried out for the higher derivative field theory just as it was 
done for the conventional O(N) model. Owing to the improved action, we can 
now select our Higgs mass value to be around 0.7 and the ghost mass parameter at 
M = 2.0 and still keep the lattice effects small. This makes it possible to perform 
such simulations on the higher derivative O(N) model. However, there are quite a 
number of technical difficulties. 

One of the main difficulties is that some efficient algorithms that are available 
for the conventional theory break down miserably for the higher derivative theory. 
For example, the over relaxation algorithm is very slow for the higher derivative 
theory due to the neighbor gathering. The cluster algorithm is simply not working 
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at all ( the whole lattice tends to become one huge cluster). In fact the only usable 
algorithm is the Fourier accelerated hybrid Monte Carlo algorithm, which only works 
for finite bare coupling. Also, it is not as efficient as the algorithms mentioned above 
for the conventional theory. This means that to really get the stable energy levels, 
we would have to run a rather long time. 

Another difficulty is the understanding of the shape of the phase shift as a 
function of the center of mass energy. If the theory is strongly interacting, we can 
no longer hope to fit the simulation data to the perturbative results. A scheme to 
extract the physical parameters like the Higgs mass and its width is needed. If the 
Higgs resonance is well separated from the ghost, we could try the Breit-Wigner 
shape near the resonance, neglecting the effects of the ghosts. 

The simulation of this project is still in progress and we hope to release the 
results in the near future. 
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Chapter 7 
Conclusions 



Our project was first motivated by the study of the Higgs mass bound prob- 
lem in a Pauli-Villars regulated theory [3, 4, 5]. This theory can be viewed as a 
limiting case of the higher derivative O(N) scalar field theory. The study of the 
higher derivative theory goes beyond the scope of the Higgs mass bound problem. 

The higher derivative O(N) model that we have studied is obtained from the 
conventional O(N) scalar field theory by adding higher derivative terms to the Higgs 
kinetic energy [1, 3, 4]. We have established the consistent quantization procedure 
of the higher derivative scalar field theory, and have shown this theory to be finite 
and unitary with possible violations of microscopic causality [2]. Therefore, the 
ghost states in the theory can easily evade experimental tests. We have also studied 
the model nonperturbatively in computer simulations by introducing an underlying 
lattice structure. 

In the continuum, the higher derivative O(N) model can be viewed as the 
Pauli-Villars regulated conventional O(N) model in the small m H /M limit, where 
m H is the Higgs mass and the M is the Pauli-Villars mass parameter. It can also 
be viewed as a finite, well-defined and unitary theory with ghost excitations. The 
continuum large N study of our model shows that this theory can incorporate a 
heavy Higgs particle in the TeV range, with the ghost pair well hidden at a few 
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times heavier than the Higgs particle [3]. 

On the lattice, our model can represent different universality classes of mod- 
els, depending on how the criticality is approached. It could represent the conven- 
tional trivial O(N) model at criticality, in which case, the higher derivative terms 
indeed become irrelevant. However, in another limit, it could also represent the 
higher derivative O(N) theory in the continuum, in which case, the theory is not 
trivial and the higher derivative terms cannot be viewed as irrelevant operators in 
the Lagrangian. 

From our simulation results of the model, it is evident that any attempt to 
perform a systematic search of higher dimensional operators to determine the Higgs 
mass bound would not make any sense [5], since, as far as the Higgs mass bound is 
concerned, one cannot tell whether a higher dimensional operator is relevant for the 
problem or not. 

In our nonperturbative simulation of the model, we find: 

(1) Our model can generate a much heavier Higgs particle than the con- 
ventional O(N) model, which is in agreement with the large N result qualitatively. 
Without introducing the more complicated structures like technicolor, it is possible 
in our model to have a strongly interacting Higgs sector, which was excluded by 
earlier lattice studies of the conventional model. 

(2) It is difficult to establish a bound for Higgs particle in our model, because 
by the time the Higgs is heavy enough, it would be impossible for us to define the 
scaling violations in our model. In fact, in our model, we believe the notion of the 
Higgs mass bound loses its meaning, unless some new nonperturbative definition is 
provided. 

Many interesting theoretical issues remains unsolved for our higher derivative 
O(N) model. For example, can this theory incorporate a techni-rho-like resonance 
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in the isospin 1 channel? This is obviously a nonperturbative problem. To answer it, 
we have to extract the phase shift in the isospin 1 channel for the higher derivative 
theory. We are still working on this issue. 
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